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Lyme Disease Vaccines 



Field of the Invention 

The present invention relates to novel vaccines for the prevention or attenuation of Lyme 
disease- The invention further relates to isolated nucleic acid molecules encoding antigenic 
10 polypeptides of Borrelia burgdorferi. Antigenic polypeptides are also provided, as are vectors, 
host cells and recombinant methods for producing the same. The invention additionally relates to 
diagnostic methods for detecting Borrelia gene expression. 

15 Background of the Invention 

Lyme disease (Steere, A.C., Proc. Natl Acad. Sci, USA 97:2378-2383 (1991)), or Lyme 
borreliosis, is presently the most common human disease in the United States transmitted by an 
arthropod vector (Center for Disease Control, Morbid. Mortal. Weekly Rep. 46(23 ):531'535 

20 (1997)). Further, infection of house-hold pets, such as dogs, is a considerable problem. 

While initial symptoms often include a rash at the infection point, Lyme disease is a 
multisystemic disorder that may include arthritic, carditic, and neurological manifestations. While 
antibiotics are currently used to treat active cases of Lyme disease, B. burgdorferi persists even 
after prolonged antibiotic treatment. Further, B. burgdorferi can persist for years in a mammalian 

25 host in the presence of an active immune response (Straubinger, R. et al, J. Clin. Microbiol. 
35:111-116 (1997); Steere, A., N. Engl. J. Med. 527:586-596 (1989)). 

Lyme disease is caused by the related tick-bome spirochetes classified as Borrelia 
burgdorferi sensu lato (including B. burgdorferi sensu stricto, B. afzelii, B. garinii). Although 
substantial progress has been made in the biochemical, ultrastmctural, and genetic characterization 

30 of the organism, the spirochetal factors responsible for infectivity, immune evasion and disease 
pathogenesis remain largely obscure. 

A number of antigenic B. burgdorferi cell surface proteins have been identified. These 
include the outer membrane surface proteins (Osp) OspA, OspB, OspC and OspD. OspA and 
OspB are encoded by tightly linked tandem genes which are transcribed as a single transcriptional 

35 unit (Brusca, J. et al., J. Bacteriol. 773:8004-8008 (1991)). The most-studied B. burgdorferi 
membrane protein is OspA, a lipoprotein antigen expressed by borreliae in resting ticks and the 
most abundant protein expressed in vitro by most borrelial isolates (Barbour, A.G., et al., 
Infection & Immunity 47:795-804 (1983); Howe, T.R., et al. Science 227\(A5 (1985)). 
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A number of different types of Lyme disease vaccines have been shown to induce 
immunological responses. Whole-cell B, burgdorferi vaccines, for example, have been shown to 
induce both immunological responses and protective immunity in several animal models 
(Reviewed in Wormser, G., Clin, Infect. Dis, 27:1267-1274 (1995)). Further, passive immunity 
5 has been demonstrated in both humans and other animals using burgdorferi specific antisera. 

While whole-cell Lyme disease vaccines confer protective immunity in animal models, use 
of such vaccines presents the risk that responsive antibodies will produce an autoimmune 
response (Reviewed in Wormser, G., supra). This problem is at least partly the resuh of the 
production of B, burgdorferi specific antibodies which cross-react with hepatocytes and both 

10 muscle and nerve cells. B, burgdorferi heat shock proteins and the 41-kd flagellin subunit are 
believed to contain antigens which elicit production of these cross-reactive antibodies. 

Single protein subunit vaccines for Lyme disease have also been tested. The cell surface 
proteins of B, burgdorferi are potential candidates for use in such vaccines and several have been 
shown to elicit protective immune responses in mammals (Probert, W. et al. Vaccine J 5: 15-19 

15 (1997); Fikrig, E. et al, Infect, Immun, (55:1658-1662 (1995); Langerman S. et aL, Nature 

572:552-556 (1994); Fikrig, E. et aL, 7, Immunol. 148:2256-2260 (1992)). Experimental OspA 
vaccines, for example, have demonstrated efficacy in several animal models (Fikrig, E., etaL, 
Proc. Natl Acad. ScL USA 89:5418-5421 (1992); Johnson, B.J., et al. Vaccine 75:1086-1094 
(1996); Fikrig, E., et ai. Infect. Immun. 60:657-661 (1992); Chang, Y.F., et aL Infection & 

20 Immunity 65:3543-3549 (1995)), and OspA vaccines for human use are under clinical evaluation 
(Keller, D., et al, J. Am. Med. Assoc. 277:1764-1768 (1994); Van Hoecke, C, et al. Vaccine 
74:1620-1626 (1996)). Passive immunity is also conferred by antisera containing antibodies 
specific for the full-length OspA protein. Further, vaccination with plasmid DNA encoding OspA 
has been demonstrated to elicit protective immune responses in mice (Luke, C. et aL, J. Infect. 

25 Dis. 775:91-97 (1997); Zhong, W. et aL, Eur J. ImmunoL 26:2749-2757 (1996)). 

Recent immunofluorescence assay observations indicate that during tick engorgement the 
expression of OspA by borreliae diminishes (deSilva, A.M., et aL, J. Exp. Med. 755:271-275 
(1996)) while expression of other proteins, exemplified by OspC, increases (Schwan, T.G., et 
aL, Proc. NatL Acad. ScL USA 92:2909-2913 (1985)). By the time of transmission to hosts, 

30 spirochetes in the tick salivary glands express little or no OspA. This down-modulation of OspA 
appears to explain the difficulties in demonstrating immune responses to this antigen early in 
infection following tick bites (Kalish, R.A., et aL, Infect. Immun. 65:2228-2235 (1995); Gem, 
L., etal., J. Infect. Dis. 767:971-975 (1993); Schiable, U.E., et aL, ImmunoL Lett. 56:219-226 
(1993)) or following challenge with limiting doses of cultured borreliae (Schiable, U.E., et aL, 

35 ImmunoL Lett. 56:219-226 (1993); Barthold, S.W. and Bockenstedt, L.K., Infect. Immun. 
67:4696-4702 (1993)). 

Furthermore, OspA-specific antibodies are ineffective if administered after a borrelial 
challenge delivered by syringe (Schiable, U.E., et aL, Proc. NatL Acad. ScL USA 87:3768-3772 
(1990)) or tick bite (deSilva, A.M., et aL, J. Exp. Med. 755:271-275 (1996)). To be efficacious. 
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OspA vaccines must elicit protective levels of antibody which are maintained throughout periods 
of tick exposure in order to block borrelia transmission from the arthropod vector. 

Vaccines in current use against other pathogens include in v/vo-expressed antigens which 
could boost anamnestic responses upon infection, potentiate the action of inmiune effector cells 
5 and complement, and inhibit key virulence mechanisms. OspC is both expressed during infection 
(Montgomery, R.R., etaL Exp, Med, 753:261-269 (1996)) and a target for protective 
immunity (Gilmore, R.D., et ai, Infect. Immun. 64:2234-2239 (1996); Probert, W.S. and 
LeFebvre, R.B., Infect Immun, 62:1920-1926 (1994); Preac-Mursic, V., et aL, Infection 
20:342-349 (1992)), but mice immunized with this protein were only protected against challenge 
10 with the homologous borrelial isolate (Probert, W.S., et al, 7. Infect, Dis, 775:400-405 (1997)). 
Identification of in v/v<?-expressed, and broadly protective, antigens of B, burgdorferi has 
remained elusive. 

Summary of the Invention 

15 The present invention provides isolated nucleic acid molecules comprising polynucleotides 

encoding the B, burgdorferi peptides having the amino acid sequences shown in Table 1. Thus, 
one aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides 
having a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence 
encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1; (b) a 

20 nucleotide sequence encoding any of the amino acid sequences of the full-length polypeptides 
shown in Table 1 but minus the N-terminal methionine residue, if present; (c) a nucleotide 
sequence encoding any of the amino acid sequences of the truncated polypeptides shown in Table 
1; and (d) a nucleotide sequence complementary to any of the nucleotide sequences in (a), (b), or 
(c) above. 

25 Further embodiments of the invention include isolated nucleic acid molecules that 

comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more 
preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in 
(a), (b), (c), or (d) above, or a polynucleotide which hybridizes under stringent hybridization 
conditions to a polynucleotide in (a), (b), (c), or (d) above. This polynucleotide which hybridizes 

30 does not hybridize under stringent hybridization conditions to a polynucleotide having a 

nucleotide sequence consisting of only A residues or of only T residues. Additional nucleic acid 
embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides 
which encode the amino acid sequences of epitope-bearing portions of a B. burgdorferi 
polypeptide having an amino acid sequence in (a), (b), or (c) above. 

35 The present invention also relates to recombinant vectors, which include the isolated 

nucleic acid molecules of the present invention, and to host cells containing the recombinant 
vectors, as well as to methods of making such vectors and host cells and for using these vectors 
for the production of B. burgdorferi polypeptides or peptides by recombinant techniques. 

The invention further provides isolated B. burgdorferi polypeptides having an amino acid 
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sequence selected from the group consisting of: (a) an amino acid sequence of any of the full- 
length polypeptides shown in Table 1 ; (b) an amino acid sequence of any of the full-length 
polypeptides shown in Table 1 but minus the N-terminal methionine residue, if present; (c) an 
amino acid sequence of any of the truncated polypeptides shown in Table 1 ; and (d) an amino acid 
5 sequence of an epitope-bearing portion of any one of the polypeptides of (a), (b), or (c). 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 
96%, 97%, 98%, or 99% similarity to those described in (a), (b), (c), or (d) above, as well as 
polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% 
10 identical, and still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 
those above; as well as isolated nucleic acid molecules encoding such polypeptides. 

The present invention further provides a vaccine, preferably a multi-component vaccine 
comprising one or more of the B, burgdorferi polypeptides shown in Table 1, or fragments 
thereof, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the 
15 B, burgdorferi poiypeptide(s) are present in an amount effective to elicit an immune response to 
members of the Borrelia genus in an animal. The B. burgdorferi polypeptides of the present 
invention may further be combined with one or more immunogens of one or more other borrelial 
or non-borrelial organisms to produce a multi-component vaccine intended to elicit an 
immunological response against members of the Borrelia genus and, optionally, one or more non- 
20 borrelial organisms. 

The vaccines of the present invention can be administered in a DNA form, e.g., "naked" 
DNA, wherein the DNA encodes one or more borrelial polypeptides and, optionally, one or more 
polypeptides of a non-borrelial organism. The DNA encoding one or more polypeptides may be 
constructed such that these polypeptides are expressed fusion proteins. 
25 The vaccines of the present invention may also be administered as a component of a 

genetically engineered organism. Thus, a genetically engineered organism which expresses one 
or more B, burgdorferi polypeptides may be administered to an animal. For example, such a 
genetically engineered organism may contain one or more B, burgdorferi polypeptides of the 
present invention intracellularly, on its cell surface, or in its periplasmic space. Further, such a 
30 genetically engineered organism may secrete one or more B, burgdorferi polypeptides. 

The vaccines of the present invention may be co-administered to an animal with an 
inmiune system modulator (e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response in an animal 
to one or more members of the Borrelia genus, e.g., B. burgdorferi sensu stricto, B, afzelii, and 
35 B. garinii^ comprising administering to the animal a vaccine as described above. 

The invention further provides a method of inducing a protective immune response in an 
animal, sufficient to prevent or attenuate an infection by members of the Borrelia genus, 
comprising administering to the animal a composition comprising one or more of the polypeptides 
shown in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may 



wo 98/59071 



PCT/US98/12718 



5 

be conjugated to another immunogen and/or administered in admixture with an adjuvant. 

The invention further relates to antibodies ehcited in an animal by the administration of one 
or more B. burgdorferi polypeptides of the present invention. 

The invention also provides diagnostic methods for detecting the expression of genes of 
members of the Borrelia genus in an animal. One such method involves assaying for the 
expression of a gene encoding Borrelia peptides in a sample from an animal. This expression 
may be assayed either directly (e.g., by assaying polypeptide levels using antibodies elicited in 
response to amino acid sequences shown in Table 1) or indirectly (e.g., by assaying for 
antibodies having specificity for amino acid sequences shown in Table 1 ). An example of such a 
method involves the use of the polymerase chain reaction (PGR) to amplify and detect Borrelia 
nucleic acid sequences. 

The present invention also relates to nucleic acid probes having all or part of a nucleotide 
sequence shown in Table 1 which are capable of hybridizing under stringent conditions to 
Borrelia nucleic acids. The invention further relates to a method of detecting one or more Borrelia 
nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids 
encoding Borrelia polypeptides, comprising: 

a) contacting the sample with one or more of the above-described nucleic acid probes, 
under conditions such that hybridization occurs, and 

b) detecting hybridization of said one or more probes to the Borrelia nucleic acid present 
in the biological sample. 

Detailed Description 

The present invention relates to recombinant antigenic B, burgdorferi polypeptides and 
fragments thereof. The invention also relates to methods for using these polypeptides to produce 
immunological responses and to confer immunological protection to disease caused by members 
of the genus Borrelia, The invention further relates to nucleic acid sequences which encode 
antigenic B, burgdorferi polypeptides and to methods for detecting Borrelia nucleic acids and 
polypeptides in biological samples. The invention also relates to Borrelia specific antibodies and 
methods for detecting such antibodies produced in a host animal. 

Definitions 

The following definitions are provided to clarify the subject matter which the inventors 
consider to be the present invention. 

As used herein, the phrase "pathogenic agent" means an agent which causes a disease state 
or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, 
fungi, viruses and metazoan parasites which either produce a disease state or render an animal 
infected with such an organism susceptible to a disease state (e.g., a secondary infection). 
Further included are species and strains of the genus Borrelia which produce disease states in 
animals. 
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As used herein, the term "organism" means any living biological system, including 
viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Borrelia" means any species or strain of bacteria which is 
members of the genus Borrelia, Included within this definition are Borrelia burgdorferi sensu lato 
(including B. burgdorferi sensu stricto, J5. afzeliiy B. garinii), B, andersonii, B. anserina, 5. 
japonica, B. coriaceae, and other members of the genus Borrelia regardless of whether they are 
known pathogenic agents. 

As used herein, the phrase "one or more B, burgdorferi polypeptides of the present 
invention" means the amino acid sequence of one or more of the B. burgdorferi polypeptides 
disclosed in Table 1. These polypeptides may be expressed as fusion proteins wherein the 
B, burgdorferi polypeptides of the present invention are linked to additional amino acid 
sequences which may be of borrelial or non-borrelial origin. This phrase further includes 
fragments of the B, burgdorferi polypeptides of the present invention. 

As used herein, the phrase "full-length amino acid sequence" and "full-length polypeptide" 
refer to an amino acid sequence or polypeptide encoded by a full-length open reading frame 
(ORF), An ORF may be defined as a nucleotide sequence bounded by stop codons which 
encodes a putative polypeptide. An ORF may also be defined as a nucleotide sequence within a 
stop codon bounded sequence which contains an initiation codon (e.g,^ a methionine or valine 
codon) on the 5* end and a stop codon on the 3' end. 

As used herein, the phrase "truncated amino acid sequence" and "truncated polypeptide" 
refer to a sub-sequence of a full-length amino acid sequence or polypeptide. Several criteria may 
also be used to define the truncated amino acid sequence or polypeptide. For example, a truncated 
polypeptide may be defined as a mature polypeptide (e.g., a polypeptide which lacks a leader 
sequence). A truncated polypeptide may also be defined as an amino acid sequence which is a 
portion of a longer sequence that has been selected for ease of expression in a heterologous 
system but retains regions which render the polypeptide useful for use in vaccines (e.g., antigenic 
regions which are expected to elicit a protective immune response). 

Additional definitions are provided throughout the specification. 
Explanation of Table 1 

Table 1 lists B. burgdorferi nucleotide and amino acid sequences of the present invention. 
The nomenclature used therein is as follows: 

"nt" refers to nucleotide sequences; 

"aa" refers to amino acid sequences; 

"f ' refers to full-length nucleotide or amino acid sequences; and 
"t" refers to truncated nucleotide or amino acid sequences. 

Thus, for example, the designation "flOl.aa" refers to the full-length amino acid sequence 
of B. burgdorferi polypeptide number 101. Further, "flOl.nt" refers to the full-length nucleotide 
sequence encoding the full-length amino acid sequence of B. burgdorferi polypeptide number 
101. 
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Explanation of Table 2 

Table 2 lists accession numbers for the closest matching sequences between the 
polypeptides of the present invention and those available through GenBank and GeneSeq 
5 databases. These reference numbers are the database entry numbers commonly used by those of 
skill in the art, who will be familar with their denominations. The descriptions of the 
numenclature for GenBank are available from the National Center for Biotechnology Information. 
Column 1 lists the gene or ORF of the present invention. Column 2 lists the accession number of 
a "match" gene sequence in GenBank or GeneSeq databases. Column 3 lists the description of 

10 the "match" gene sequence. Colunms 4 and 5 are the high score and smallest sum probability, 
respectively, calculated by BLAST. Polypeptides of the present invention that do not share 
significant identity/similarity with any polypeptide sequences of GenBank and GeneSeq are not 
represented in Table 2. Polypeptides of the present invention that share significant 
identity/similarity with more than one of the polypeptides of GenBank and GeneSeq are 

15 represented more than once. 

Explanation of Table 3. 

The B. burgdorferi polypeptides of the present invention may include one or more conservative 
amino acid substitutions from natural mutations or human manipulation as indicated in Table 3. 
20 Changes are preferably of a minor nature, such as conservative amino acid substitutions that do 
not significantly affect the folding or activity of the protein. Residues from the following groups, 
as indicated in Table 3, may be substituted for one another: Aromatic, Hydrophobic, Polar, Basic, 
Acidic, and Small, 

25 Explanation of Table 4 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope-bearing fragments 
present in each of the full length B. burgdorferi polypeptides described in Table 1 as predicted by 
the inventors using the algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4: 181-186. 
The Jameson-Wolf antigenic analysis was performed using the computer program PROTEAN 

30 (Version 3. 1 1 for the Power Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, 
WI). B, burgdorferi polypeptide shown in Table 1 may one or more antigenic epitopes 
comprising residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the determinant may vary 
slighdy. The residues and locations shown described in Table 4 correspond to the amino acid 

35 sequences for each full length gene sequence shown in Table 1 and in the Sequence Listing. 
Polypeptides of the present invention that do not have antigenic epitopes recognized by the 
Jameson- Wolf algorithm are not represented in Table 2. 
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Selection of Nucleic Acid Sequences Encoding Antigenic B. burgdorferi 
Polypeptides 

The present invention provides a select number of ORFs from those presented in the 
fragments of the Borrelia burgdorferi genome which may prove useful for the generation of a 
5 protective immune response. The sequenced B. burgdorferi genomic DNA was obtained frona a 
subK:ultured isolate of ATCC Deposit No. 35210. The sub-cultured isolate was deposited on 
August 8, 1997 at the American Type Culture Collection, 12301 Park Lawn Drive, Rockville, 
Maryland 20852, and given accession number 202012. 

Some ORFs contained in the subset of fragments of the B, burgdorferi genome disclosed 
10 herein were derived through the use of a number of screening criteria detailed below. The ORFs 
are generally bounded at the amino terminus by a methionine residue and at the carboxy terminus 
by a stop codon. 

Many of the selected sequences do not consist of complete ORFs. Although a polypeptide 
representing a complete ORF may be the closest approximation of a protein native to an organism, 

15 it is not always preferred to express a complete ORF in a heterologous system. It may be 

challenging to express and purify a highly hydrophobic protein by conmion laboratory methods. 
Some of the polypeptide vaccine candidates described herein have been modified slightly to 
simplify the production of recombinant protein. For example, nucleotide sequences which encode 
highly hydrophobic domains, such as those found at the amino terminal signal sequence, have 

20 been excluded from some constructs used for in vitro expression of the polypeptides. 

Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus 
have also been excluded from the recombinant expression constructs. Thus, in one embodiment, 
a polypeptide which represents a truncated or modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially immunogenic 

25 polypeptides, many of the ORFs disclosed herein were selected on the basis of screening all 

theoretical Borrelia burgdorferi ORFs for several aspects of potential immunogenicity. One set of 
selection criteria are as follows: 

1 . Type I signal sequence: An amino terminal type I signal sequence generally directs a 
nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. 

30 Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I 
signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and 
Kendall, D. A. Mol Microbiol 75:765-773 (1994)). The length of the type I signal sequence is 
approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in 
the extreme amino terminus. In addition, the central region of the signal sequence adopts an 

35 alpha-helical conformation in a hydrophobic environment. Finally, the region surrounding the 
actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 and 
-3 positions. 

2. Type IV signal sequence: The type IV signal sequence is an example of the several 
types of functional signal sequences which exist in addition to the type I signal sequence detailed 
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above. Although functionally related, the type IV signal sequence possesses a unique set of 
biochemical and physical attributes (Strom, M. S. and Lory, S., J, Bacterial 774:7345-7351 
(1992)). These are typically six to eight amino acids with a net basic charge followed by an 
additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal 
5 sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In 
addition, type IV signal sequences generally contain a phenylalanine residue at the +1 site relative 
to the cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors 
has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly 

10 three-fourths of the bacterial lipoprotein precursors examined contained the sequence L-(A,S)- 
(G,A)-C at positions -3 to +1, relative to the point of cleavage (Hayashi, S. and Wu, H. C, 7. 
Bioenerg. Biomembr. 22'A5\-A1\ (1990)). 

4. LPXTG motif: It has been experimentally determined that most anchored proteins 
found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal 

15 sequence. More than fifty such proteins from organisms such as 5. pyogenes, S. mutans^ B. 

burgdorferi, S. pneumoniae^ and others, have been identified based on their extracellular location 
and carboxy terminal amino acid sequence (Fischetti, V. A,, ASM News 62:405-410 (1996)). 
The conserved region consists of six charged amino acids at the extreme carboxy terminus 
coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. 

20 Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in 

nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X 
is any amino acid. 

An algorithm for selecting antigenic and immunogenic Borrelia burgdorferi polypeptides 
including the foregoing criteria was developed. The algorithm is similar to that described in U.S. 
25 patent application 08/781,986, filed January 3, 1997, which is fully incorporated by reference 
herein. Use of the algorithm by the inventors to select immunologically useful Borrelia 
burgdorferi polypeptides resulted in the selection of a number of the disclosed ORFs. 
Polypeptides comprising the polypeptides identified in this group may be produced by techniques 
standard in the art and as further described herein. 

30 

Nucleic Acid Molecules 

The present invention provides isolated nucleic acid molecules comprising polynucleotides 
encoding the B, burgdorferi polypeptides having the amino acid sequences shown in Table 1 , 
which were determined by sequencing the genome of B. burgdorferi deposited as ATCC deposit 
35 no. 202012 and selected as putative inmiunogens. 

Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA 
molecule herein were determined using an automated DNA sequencer (such as the Model 373 
from Applied Biosystems, Inc.), and all amino acid sequences of polypeptides encoded by DNA 
molecules determined herein were predicted by translation of DNA sequences determined as 
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above. Therefore, as is known in the art for any DNA sequence determined by this automated 
approach, any nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more typically at 
least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the 
5 sequenced DNA molecule. The actual sequence can be more precisely determined by other 

approaches including manual DNA sequencing methods well known in the art. As is also known 
in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual 
sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted 
amino acid sequence encoded by a determined nucleotide sequence will be completely different 

10 from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the 
point of such an insertion or deletion. 

Unless otherwise indicated, each "nucleotide sequence" set forth herein is presented as a 
sequence of deoxyribonucleotides (abbreviated A, G , C and T). However, by "nucleotide 
sequence" of a nucleic acid molecule or polynucleotide is intended, for a DNA molecule or 

15 polynucleotide, a sequence of deoxyribonucleotides, and for an RNA molecule or polynucleotide, 
the corresponding sequence of ribonucleotides (A, G, C and U), where each thymidine 
deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced by the 
ribonucleotide uridine (U). For instance, reference to an RNA molecule having a sequence of 
Table 1 set forth using deoxyribonucleotide abbreviations is intended to indicate an RNA molecule 

20 having a sequence in which each deoxyribonucleotide A, G or C of Table 1 has been replaced by 
the corresponding ribonucleotide A, G or C, and each deoxyribonucleotide T has been replaced 
by a ribonucleotide U. 

Nucleic acid molecules of the present invention may be in the form of RNA, such as 
mRNA, or in the form of DNA, including, for instance, cDN A and genomic DNA obtained by 

25 cloning or produced synthetically. The DNA may be double-stranded or single-stranded. 

Single-stranded DNA or RNA may be the coding strand, also known as the sense strand, or it 
may be the non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA, 
which has been removed from its native environment. For example, recombinant DNA molecules 

30 contained in a vector are considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated 
RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present 
invention. Isolated nucleic acid molecules according to the present invention further include such 

35 molecules produced synthetically. 

In addition, isolated nucleic acid molecules of the invention include DNA molecules which 
comprise a sequence substantially different from those described above but which, due to the 
degeneracy of the genetic code, still encode a B. burgdorferi polypeptides and peptides of the 
present invention (e.g. polypeptides of Table 1). That is, all possible DNA sequences that encode 
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the B. burgdorferi polypeptides of the present invention. This includes the genetic code and 
species-specific codon preferences known in the art. Thus, it would be routine for one skilled in 
the art to generate the degenerate variants described above, for instance, to optinii2;e codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those preferred by a 
5 mammalian or other bacterial host such as E, coli). 

The invention further provides isolated nucleic acid molecules having the nucleotide 
sequence shown in Table 1 or a nucleic acid molecule having a sequence complementary to one of 
the above sequences. Such isolated molecules, particularly DNA molecules, are useful as probes 
for gene mapping and for identifying B. burgdorferi in a biological sample, for instance, by PCR, 

10 Southern blot, Northem blot, or other form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding portions or 
fragments of the nucleotide sequences described herein. Fragments include portions of the 
nucleotide sequences of Table 1 at least 10 contiguous nucleotides in length selected from any two 
integers, one of which representing a 5' nucleotide position and a second of which representing a 

15 3* nucleotide position, where the first nucleotide for each nucleotide sequence in Table 1 is 

position 1. That is, every combination of a 5' and 3' nucleotide position that a fragment at least 
10 contiguous nucleotides in length could occupy is included in the invention. "At least" means a 
fragment may be 10 contiguous nucleotide bases in length or any integer between 10 and the 
length of an entire nucleotide sequence of Table 1 minus 1. Therefore, included in the invention 

20 are contiguous fragments specified by any 5' and 3' nucleotide base positions of a nucleotide 

sequences of Table 1 wherein the contiguous fragment is any integer between 10 and the length of 
an entire nucleotide sequence minus 1. 

Further, the invention includes polynucleotides comprising fragments specified by size, in 
nucleotides, rather than by nucleotide positions. The invention includes any fragment size, in 

25 contiguous nucleotides, selected from integers between 10 and the length of an entire nucleotide 
sequence minus 1. Preferred sizes of contiguous nucleotide fragments include 20 nucleotides, 30 
nucleotides, 40 nucleotides, 50 nucleotides. Other preferred sizes of contiguous nucleotide 
fragments, which may be useful as diagnostic probes and primers, include fragments 50-300 
nucleotides in length which include, as discussed above, fragment sizes representing each integer 

30 between 50-300. Larger fragments are also useful according to the present invention 

corresponding to most, if not all, of the nucleotide sequences shown in Table lor of the B, 
burgdorferi nucleotide sequences of the plasimd clones listed in Table 1 . The preferred sizes are, 
of course, meant to exemplify not limit the present invention as all size fragments, representing 
any integer between 10 and the length of an entire nucleotide sequence minus 1, are included in 

35 the invention. Additional preferred nucleic acid fragments of the present invention include 
nucleic acid molecules encoding epitope-bearing portions of B, burgdorferi polypeptides 
identified in Table 4. 

The present invention also provides for the exclusion of any fragment, specified by 5' and 
3' base positions or by size in nucleotide bases as described above for any nucleotide sequence of 
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Table 1 or the plasimd clones listed in Table 1 . Any number of fragments of nucleotide sequences 
in Table 1 or the plasimd clones listed in Table 1, specified by 5* and 3' base positions or by size 
in nucleotides, as described above, may be excluded from the present invention. 

Preferred nucleic acid fragments of the present invention also include nucleic acid 
5 molecules encoding epitope-bearing portions of the B, burgdorferi polypeptides shown in Table 
1. Such nucleic acid fragments of the present invention include, for example, nucleic acid 
molecules encoding polypeptide fragments comprising from about the amino terminal residue to 
about the carboxy terminal residue of each fragment shown in Table 4. The above referred to 
polypeptide fragments are antigenic regions of particular B. burgdorferi polypeptides shown in 

10 Table 1 . Methods for determining other such epitope-bearing portions for the remaining 

polypeptides described in Table 1 are well known in the art and are described in detail below. 

In another aspect, the invention provides isolated nucleic acid molecules comprising 
polynucleotides which hybridize under stringent hybridization conditions to a portion of a 
polynucleotide in a nucleic acid molecule of the invention described above, for instance, a nucleic 

15 acid sequence shown in Table 1, By "stringent hybridization conditions" is intended ovemight 
incubation at 42 C in a solution comprising: 50% formamide, 5x SSC (150 mM NaCl, 15 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran 
sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 
O.lx SSC at about 65 C. 

20 By polynucleotides which hybridize to a "portion" of a polynucleotide is intended 

polynucleotides (either DNA or RNA) which hybridize to at least about 15 nucleotides (nt), and 
more preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably about 30-70 nt of the reference polynucleotide. These are useful as diagnostic probes 
and primers as discussed above and in more detail below. 

25 Of course, polynucleotides hybridizing to a larger portion of the reference polynucleotide, 

for instance, a portion 50-100 nt in length, or even to the entire length of the reference 
polynucleotide, are also useful as probes according to the present invention, as are 
polynucleotides corresponding to most, if not all, of a nucleotide sequence as shown in Table 1 . 
By a portion of a polynucleotide of "at least 20 nt in length," for example, is intended 20 or more 

30 contiguous nucleotides from the nucleotide sequence of the reference polynucleotide (e.g., a 
nucleotide sequences as shown in Table 1). As noted above, such portions are useful 
diagnostically either as probes according to conventional DNA hybridization techniques or as 
primers for amplification of a target sequence by PCR, as described, for instance, in Molecular 
Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J., Fritsch, E. F. and Maniatis, T., 

35 eds.. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), the entire 
disclosure of which is hereby incorporated herein by reference. 

Since nucleic acid sequences encoding the B, burgdorferi polypeptides of the present 
invention are provided in Table 1, generating polynucleotides which hybridize to portions of these 
sequences would be routine to the skilled artisan. For example, the hybridizing polynucleotides 



wo 98/59071 PCTAJS98/12718 

13 

of the present invention could be generated synthetically according to known techniques. 

As indicated, nucleic acid molecules of the present invention which encode B, burgdorferi 
polypeptides of the present invention may include, but are not limited to those encoding the amino 
acid sequences of the polypeptides by themselves; and additional coding sequences which code 
for additional amino acids, such as those which provide additional functionalities. Thus, the 
sequences encoding these polypeptides may be fused to a marker sequence, such as a sequence 
encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred 
embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), among others, many of which 
are commercially available. As described in Gentz etaL, Proc. Natl Acad, ScL USA 56:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the resulting fusion 
protein. 

Thus, the present invention also includes genetic fusions wherein the 
B, burgdorferi nucleic acid sequences coding sequences provided in Table 1 are linked to 
additional nucleic acid sequences to produce fusion proteins. These fusion proteins may include 
epitopes of borrelial or non-borrelial origin designed to produce proteins having enhanced 
inununogenicity. Further, the fusion proteins of the present invention may contain antigenic 
determinants known to provide helper T-cell stimulation, peptides encoding sites for 
post-translational modifications which enhance immunogenicity (e.g.^ acylation), peptides which 
facilitate purification (e.g., histidine "tag"), or amino acid sequences which target the fusion 
protein to a desired location (e.g., a heterologous leader sequence). For instance, hexa-histidine 
provides for convenient purification of the fusion protein. See Gentz et al. (1989) Proc. Natl. 
Acad. Sci, 86:821-24. The "HA" tag is another peptide useful for purification which corresponds 
to an epitope derived from the influenza hemagglutinin protein. See Wilson et al. (1984) Cell 
37:767. As discussed below, other such fusion proteins include the B, burgdorferi polypeptides 
of the present invention fused to Fc at the N- or C-terminus. 

Post-translational modification of the full-length B. burgdorferi OspA protein 
expressed in E, coli is believed to increase the immunogenicity of this protein. Erdile, L. et al, 
Infect Immun, 67:81-90 (1993), B. burgdorferi OspA when expressed in £. coli, for example, 
is post-translationally modified in at least two ways. First, a signal peptide is cleaved; second, 
lipid moieties are attached. The presence of these lipid moieties is believed to confer enhanced 
immunogenicity and results in the elicitation of a strong protective immunological response. 

Variant and Mutant Polynucleotides 

The present invention thus includes nucleic acid molecules and sequences which encode 
fusion proteins comprising one or more B, burgdorferi polypeptides of the present invention 
fused to an amino acid sequence which allows for post-translational modification to enhance 
immunogenicity. This post-translational modification may occur either in vitro or when the fusion 
protein is expressed in vivo in a host cell. An example of such a modification is the introduction 
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of an amino acid sequence which results in the attachment of a lipid moiety. Such a lipid moiety 
attachment site of OspA, which is lipidated upon expression in E. coli, has been identified. 
Bouchon, B. et aL, Anal Biochem. 246:52-61 (1997). 

Thus, as indicated above, the present invention includes genetic fusions wherein a 
5 B. burgdorferi nucleic acid sequence provided in Table 1 is linked to a nucleotide sequence 
encoding another amino acid sequence. These other amino acid sequences may be of borrelial 
origin (e.g,^ another sequence selected from Table 1) or non-borrelial origin. An example of such 
a fusion protein is reported in Fikrig, E. et aL, Science 250:553-556 (1990) where an OspA- 
glutathione-5-transferase fusion protein was produced and shown to elicit protective immunity 

10 against Lyme disease in immune competent mice. 

The present invention further relates to variants of the nucleic acid molecules of the present 
invention, which encode portions, analogs or derivatives of the B. burgdorferi polypeptides 
shown in Table 1. Variants may occur naturally, such as a natural allelic variant. By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 

15 chromosome of an organism. Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985). 
Non-naturally occurring variants may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions or additions. 
The substitutions, deletions or additions may involve one or more nucleotides. These variants 
may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions 

20 may produce conservative or non-conservative amino acid substitutions, deletions or additions. 
Especially preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties and activities of the B, burgdorferi polypeptides disclosed herein or portions 
thereof. Also especially preferred in this regard are conservative substitutions. 

The present application is further directed to nucleic acid molecules at least 90%, 95%, 

25 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1. The above 

nucleic acid sequences are included irrespective of whether they encode a polypeptide having B. 
burgdorferi activity. This is because even where a particular nucleic acid molecule does not 
encode a pol5^ptide having B. burgdorferi activity, one of skill in the art would still know how 
to use the nucleic acid molecule, for instance, as a hybridization probe. Uses of the nucleic acid 

30 molecules of the present invention that do not encode a polypeptide having B. burgdorferi activity 
include, inter alia, isolating an B. burgdorferi gene or allelic variants thereof from a DNA library, 
and detecting fl. burgdorferi mRNA expression samples, environmental samples, suspected of 
containing B, burgdorferi by Northern Blot analysis. 

Embodiments of the invention include isolated nucleic acid molecules comprising a 

35 polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 
95%, 96%, 97%, 98% or 99% identical to (a) a nucleotide sequence encoding any of the amino 
acid sequences of the full-length polypeptides shown in Table 1 ; (b) a nucleotide sequence 
encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1 but 
minus the N-terminal methionine residue, if present; (c) a nucleotide sequence encoding any of the 
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amino acid sequences of the truncated polypeptides sliown in Table 1 ; and (d) a nucleotide 
sequence complementary to any of the nucleotide sequences in (a), (b), or (c) above. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 97%, 
98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, in fact, encode a 
5 polypeptide having B, burgdorferi protein activity By "a polypeptide having R burgdorferi 
activity" is intended polypeptides exhibiting activity similar, but not necessarily identical, to an 
activity of the B. burgdorferi protein of the invention, as measured in a particular biological assay 
suitable for measuring activity of the specified protein. 

Due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately 

10 recognize that a large number of the nucleic acid molecules having a sequence at least 90%, 95%, 
96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown in Table 1 will encode a 
polypeptide having B. burgdorferi protein activity. In fact, since degenerate variants of these 
nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even 
without performing the above described comparison assay. It will be further recognized in the art 

15 that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having B. burgdorferi protein activity. This is because the skilled 
artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly effect protein function (e.g., replacing one aliphatic amino acid with a second 
aliphatic amino acid), as further described below. 

20 The biological activity or function of the polypeptides of the present invention are expected 

to be similar or identical to polypeptides from other bacteria that share a high degree of structural 
identity/similarity. Tables 2 lists accession numbers and descriptions for the closest matching 
sequences of polypeptides available through Genbank and Derwent databases. It is therefore 
expected that the biological activity or function of the polypeptides of the present invention will be 

25 similar or identical to those polypeptides from other bacterial genuses, species, or strains listed in 
Table 2. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% "identical" to 
a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence 
of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence 

30 may include up to five point mutations per each 100 nucleotides of the reference nucleotide 

sequence encoding the B, burgdorferi polypeptide. In other words, to obtain a polynucleotide 
having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% 
(5 of 100) of the nucleotides in the reference sequence may be deleted, inserted, or substituted 
with another nucleotide. The query sequence may be an entire sequence shown in Table 1, the 

35 ORF (open reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 
90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention 
can be determined conventionally using known computer programs. A preferred method for 
determining the best overall match between a query sequence (a sequence of the present invention) 
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and a subject sequence, also referred to as a global sequence alignment, can be determined using 
the FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et al. 
(1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences 
are both DNA sequences. An RNA sequence can be compared by first converting U's to T's. 
5 The result of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k- 
tuple=4. Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff 
Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

10 If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, 

not because of internal deletions, a manual correction must be made to the results. This is because 
the FASTDB program does not account for 5' and 3' truncations of the subject sequence when 
calculating percent identity. For subject sequences truncated at the 5' or y ends, relative to the 
query sequence, the percent identity is corrected by calculating the number of bases of the query 

15 sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to arrive at a 
final percent identity score. This corrected score is what is used for the purposes of the present 

20 invention. Only nucleotides outside the 5' and 3' nucleotides of the subject sequence, as 

displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide query 
sequence to determine percent identity. The deletions occur at the 5' end of the subject sequence 

25 and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 

nucleotides at 5' end. The 10 unpaired nucleotides represent 10% of the sequence (number of 
nucleotides at the 5' and 3' ends not matched/total number of nucleotides in the query sequence) 
so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the 
remaining 90 nucleotides were perfectly matched the final percent identity would be 90%. In 

30 another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide query 

sequence. This time the deletions are internal deletions so that there are no nucleotides on the 5' 
or 3' of the subject sequence which are not matched/aligned with the query. In this case the 
percent identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 
5' and 3' of the subject sequence which are not matched/aligned with the query sequence are 

35 manually corrected for. No other manual corrections are to made for the purposes of the present 
invention. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated DNA molecules of 
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the present invention, host cells which are genetically engineered with the recombinant vectors, 
and the production of B. burgdorferi polypeptides or fragments thereof by recombinant 
techniques. 

Recombinant constructs may be introduced into host cells using well known techniques 
5 such as infection, transduction, transfection, transvection, electroporation and transformation. 
The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors 
may be rephcation competent or replication defective. In the latter case, viral propagation 
generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 

10 propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium 
phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be 
packaged in vitro using an appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising c/5-acting control regions to the polynucleotide of 
interest. Appropriate rran^-acting factors may be supplied by the host, supplied by a 

15 complementing vector or supplied by the vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for specific 
expression, which may be inducible and/or cell type-specific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to manipulate, such as 
temperature and nutrient additives. 

20 Expression vectors useful in the present invention include chromosomal-, episomal- and 

virus-derived vectors, e.g., vectors derived from bacterial plasmids, bacteriophage, yeast 
episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors 
derived from combinations thereof, such as cosmids and phagemids. 

25 The DNA insert should be operatively linked to an appropriate promoter, such as the 

phage lambda PL promoter, the E. coli lac, trp and tac promoters, the S V40 early and late 
promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters will be 
known to the skilled artisan. The expression constructs will further contain sites for transcription 
initiation, termination and, in the transcribed region, a ribosome binding site for translation. The 

30 coding portion of the mature transcripts expressed by the constructs will preferably include a 
translation initiating site at the beginning and a termination codon (UAA, UGA or U AG) 
appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable marker. 
Such maricers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture 

35 and tetracycline or ampicillin resistance genes for culturing in E. coli and other bacteria. 

Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such 
as E. coli, Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the 
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above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, available 
from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNHlSA, 
pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, 
pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. Among preferred eukaryotic 
vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and 
pSVK3, pBPV, pMSG and pS\TL available from Pharmacia. Other suitable vectors will be 
readily apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention include the E. 
coli lad and lacX promoters, the T3 and T7 promoters, the gpt promoter, the lambda PR and PL 
promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters 
of retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and metallothionein 
promoters, such as the mouse metallothionein-I promoter. 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection or other methods. Such methods are described in many 
standard laboratory manuals, such as Davis et al, Basic Methods In Molecular Biology (1986). 

Transcription of DNA encoding the polypeptides of the present invention by higher 
eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are 
cw-acting elements of DNA, usually about from 10 to 300 bp that act to increase transcriptional 
activity of a promoter in a given host cell-type. Examples of enhancers include the SV40 
enhancer, which is located on the late side of the replication origin at bp 100 to 270, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication 
origin, and adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, 
into the periplasmic space or into the extracellular environment, appropriate secretion signals may 
be incorporated into the expressed polypeptide. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion protein, and may 
include not only secretion signals, but also additional heterologous functional regions. For 
instance, a region of additional amino acids, particularly charged amino acids, may be added to 
the N-terminus of the polypeptide to improve stabihty and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties may be added to 
the polypeptide to facilitate purification. Such regions may be removed prior to final preparation 
of the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate purification, among others, are familiar and routine 
techniques in the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 533 (Canadian 
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counterpart 2045869) discloses fusion proteins comprising various portions of constant region of 
inununoglobin molecules together with another human protein or part thereof. In many cases, the 
Fc part in a fusion protein is thoroughly advantageous for use in therapy and diagnosis and thus 
results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 
hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein 
has been expressed, detected and purified in the advantageous manner described. This is the case 
when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the 
fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human 
proteins, such as, hIL5-receptor has been fused with Fc portions for the purpose of 
high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et aL, 7. 
Molec, Recogn, 5:52-58 (1995) and Johanson, K. etai, J, BioL Chem. 270 C76j:9459-9471 
(1995). 

The B. burgdorferi polypeptides can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxy lapatite 
chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") 
is employed for purification. Polypeptides of the present invention include naturally purified 
products, products of chemical synthetic procedures, and products produced by recombinant 
techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 

Polypeptides and Fragments 

The invention further provides isolated polypeptides having the amino acid sequences in 
Table 1, and peptides or polypeptides comprising portions of the above polypeptides. The terms 
"peptide" and "oligopeptide" are considered synonymous (as is commonly recognized) and each 
term can be used interchangeably as the context requires to indicate a chain of at least to amino 
acids coupled by peptidyl linkages. The word "polj^eptide" is used herein for chains containing 
more than ten amino acid residues. All oligopeptide and polypeptide formulas or sequences 
herein are written from left to right and in the direction from amino terminus to carboxy terminus. 

As discussed in detail below, immunization using B. burgdorferi sensu stricto isolate B31 
decorin-binding protein elicits the production of antiserum which confers passive immunity 
against Borrelia species and strains which express divergent forms of this protein. Cassatt, D. et 
aL, Protection of Borrelia burgdorferi Infection by Antibodies to Decorin-binding Protein^ in 
Vaccines97, Cold Spring Harbor Press (1997), pages 191-195. Thus, some amino acid 
sequences of the B. burgdorferi polypeptides shown in Table 1 can be varied without 
significantly effecting the antigenicity of the polypeptides. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the polypeptide which 
determine antigenicity. In general, it is possible to replace residues which do not form part of an 
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antigenic epitope without significantly effecting the antigenicity of a polypeptide. 
Variant and Mutant Polypeptides 

To improve or alter the characteristics of B, burgdorferi polypeptides of the present 
invention, protein engineering may be employed. Recombinant DNA technology known to those 
5 skilled in the art can be used to create novel mutant proteins or muteins including single or 
multiple amino acid substitutions, deletions, additions, or fusion proteins. Such modified 
polypeptides can show, e.g., enhanced activity or increased stability. In addition, they may be 
purified in higher yields and show better solubility than the corresponding natural polypeptide, at 
least under certain purification and storage conditions. 

10 

N'Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. For instance, Ron et al. J. Biol. 
Chem., 268:2984-2988 (1993), reported modified KGF proteins that had heparin binding 

15 activity even if 3, 8, or 27 N-terminal amino acid residues were missing. Accordingly, the 
present invention provides polypeptides having one or more residues deleted from the amino 
terminus of the amino acid sequence of the fi. burgdorferi polypeptides shown in Table 1 , and 
polynucleotides encoding such polypeptides. 

Similarly, many examples of biologically functional C-terminal deletion muteins are 

20 known. For instance, Interferon gamma shows up to ten times higher activities by deleting 8-10 
amino acid residues from the carboxy terminus of the protein See, e.g., Dobeli, et al. (1988) J. 
Biotechnology 7:199-216. Accordingly, the present invention provides polypeptides having one 
or more residues from the carboxy terminus of the amino acid sequence of the B, burgdorferi 
polypeptides shown in Table 1. The invention also provides polypeptides having one or more 

25 amino acids deleted from both the amino and the carboxyl termini as described below. 

The present invention is further directed to polynucleotide encoding portions or fragments 
of the amino acid sequences described herein as well as to portions or fragments of the isolated 
amino acid sequences described herein. Fragments include portions of the amino acid sequences 
of Table 1, are at least 5 contiguous amino acid in length, are selected from any two integers, one 

30 of which representing a N-terminal position. The initiation codon of the polypeptides of the 

present inventions position 1. Every combination of a N-terminal and C-terminal position that a 
fragment at least 5 contiguous amino acid residues in length could occupy, on any given amino 
acid sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of residues in a 

35 full length amino acid sequence minus 1 . Therefore, included in the invention are contiguous 
fragments specified by any N-terminal and C-terminal positions of amino acid sequence set forth 
in Table 1 wherein the contiguous fragment is any integer between 5 and the number of residues 
in a full length sequence minus 1 . 

Further, the invention includes polypeptides comprising fragments specified by size, in 
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amino acid residues, rather than by N-terminal and C-terminal positions. The invention includes 
any fragment size, in contiguous amino acid residues, selected from integers between 5 and the 
number of residues in a full length sequence minus 1 . Preferred sizes of contiguous polypeptide 
fragments include about 5 amino acid residues, about 10 amino acid residues, about 20 amino 
5 acid residues, about 30 amino acid residues, about 40 amino acid residues, about 50 amino acid 
residues, about 100 amino acid residues, about 200 amino acid residues, about 300 amino acid 
residues, and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any integer between 5 
and the number of residues in a full length sequence minus 1 are included in the invention. The 

10 present invention also provides for the exclusion of any fragments specified by N-terminal and C- 
terminal positions or by size in amino acid residues as described above. Any number of 
fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as 
described above may be excluded. 

The above fragments need not be active since they would be useful, for example, in 

15 immunoassays, in epitope mapping, epitope tagging, to generate antibodies to a particular portion 
of the protein, as vaccines, and as molecular weight markers. 

Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, it also will 

20 be recognized by one of ordinary skill in the art that some amino acid sequences of the S. 

burgdorferi polypeptide can be varied without significant effect of the structure or function of the 
protein. If such differences in sequence are contemplated, it should be remembered that there will 
be critical areas on the protein which determine activity. 

Thus, the invention further includes variations of the R burgdorferi polypeptides which 

25 show substantial B. burgdorferi polypeptide activity or which include regions of B. burgdorferi 
protein such as the protein portions discussed below. Such mutants include deletions, insertions, 
inversions, repeats, and type substitutions selected according to general rules known in the art so 
as to have litde effect on activity. For example, guidance concerning how to make phenotypically 
silent amino acid substitutions is provided. There are two main approaches for studying the 

30 tolerance of an amino acid sequence to change. See^ Bowie, J. U. et aL (1990), Science 

247:1306-1310. The first method relies on the process of evolution, in which mutations are either 
accepted or rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or screens to 
identify sequences that maintain functionality. 

35 These studies have revealed that proteins are surprisingly tolerant of amino acid 

substitutions. The studies indicate which amino acid changes are likely to be permissive at a 
certain position of the protein. For example, most buried amino acid residues require nonpolar 
side chains, whereas few features of surface side chains are generally conserved. Other such 
phenotypically silent substitutions are described by Bowie et al. {supra) and the references cited 
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therein. Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and lie; interchange of the hydroxyl residues Ser 
and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues 
Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 
5 residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 1, or that 
encoded by the plaimds listed in Table 1, may be: (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be one 

10 encoded by the genetic code: or (ii) one in which one or more of the amino acid residues includes 
a substituent group: or (iii) one in which the burgdorferi polypeptide is fused with another 
compound, such as a compound to increase the half-life of the polypeptide (for example, 
polyethylene glycol): or (iv) one in which the additional amino acids are fused to the above form 
of the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory sequence or a 

15 sequence which is employed for purification of the above form of the polypeptide or a proprotein 
sequence. Such fragments, derivatives and analogs are deemed to be within the scope of those 
skilled in the art from the teachings herein. 

Thus, the B. burgdorferi polypeptides of the present invention may include one or more 
amino acid substitutions, deletions, or additions, either from natural mutations or human 

20 manipulation. As indicated, changes are preferably of a minor nature, such as conservative amino 
acid substitutions that do not significantly affect the folding or activity of the protein (see Table 3). 

Amino acids in the 5. burgdorferi proteins of the present invention that are essential for 
function can be identified by methods known in the art, such as site-directed mutagenesis or 
alanine-scanning mutagenesis. See, e.g., Cunningham et al. (1989) Science 244:1081-1085. 

25 The latter procedure introduces single alanine mutations at every residue in the molecule. The 
resulting mutant molecules are then tested for biological activity using assays appropriate for 
measuring the function of the particular protein. 

Of special interest are substitutions of charged amino acids with other charged or neutral 
amino acids which may produce proteins with highly desirable improved characteristics, such as 

30 less aggregation. Aggregation may not only reduce activity but also be problematic when 
preparing pharmaceutical formulations, because aggregates can be immunogenic. See, e.g., 
Pinckard et al., (1967) Clin. Exp. Inmiunol. 2:331-340; Robbins, et al., (1987) Diabetes 36:838- 
845; Cleland, et al., (1993) Crit. Rev. Therapeutic Drug Carrier Systems 10:307-377. 

The polyjjeptides of the present invention are preferably provided in an isolated form, and 

35 preferably are substantially purified. A recombinantly produced version of the B. burgdorferi 

polypeptide can be substantially purified by the one-step method described by Smith et al. (1988) 
Gene 67:31-40. Polypeptides of the invention also can be purified from natural or recombinant 
sources using antibodies directed against the polypeptides of the invention in methods which are 
well known in the art of protein purification. 



wo 98/59071 PCT/US98/12718 

23 

The invention further provides for isolated B. burgdorferi polypeptides comprising an 
amino acid sequence selected from the group consisting of: (a) the amino acid sequence of a full- 
length B. burgdorferi polypeptide having the complete amino acid sequence shown in Table 1 ; 
(b) the amino acid sequence of a full-length B, burgdorferi polypeptide having the complete amino 
acid sequence shown in Table 1 excepting the N-terminal methionine; (c) the complete amino acid 
sequence encoded by the plaimds listed in Table 1 ; and (d) the complete amino acid sequence 
excepting the N-terminal methionine encoded by the plaimds listed in Table 1. The polypeptides 
of the present invention also include polypeptides having an amino acid sequence at least 80% 
identical, more preferably at least 90% identical, and still more preferably 95%, 96%, 97%, 98% 
or 99% identical to those described in (a), (b), (c), and (d) above. 

Further polypeptides of the present invention include polypeptides which have at least 
90% similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 
97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which comprises the amino 
acid sequence of a B. burgdorferi polypeptide having an amino acid sequence which contains at 
least one conservative amino acid substitution, but not more than 50 conservative amino acid 
substitutions, not more than 40 conservative amino acid substitutions, not more than 30 
conservative amino acid substitutions, and not more than 20 conservative amino acid 
substitutions. Also provided are polypeptides which comprise the amino acid sequence of a B. 
burgdorferi polypeptide, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
conservative amino acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to 
a query amino acid sequence of the present invention, it is intended that the amino acid sequence 
of the subject polypeptide is identical to the query sequence except that the subject polypeptide 
sequence may include up to five amino acid alterations per each 100 amino acids of the query 
amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. These 
alterations of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 96%, 97%, 
98% or 99% identical to, for instance, the amino acid sequences shown in Table 1 or to the amino 
acid sequence encoded by the plaimds listed in Table 1 can be determined conventionally using 
known computer programs. A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence ahgnment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence ahgnment the 
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query and subject sequences are both amino acid sequences. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment 
are: Matrix=PAM 0, k-tuple=2. Mismatch Penalty=l, Joining Penalty=20, Randomization Group 
Length=0, Cutoff Score=l, Window Size==sequence length. Gap Penalty=5, Gap Size 
5 Penalty=0.05, Window Si2e=500 or the length of the subject amino acid sequence, whichever is 
shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, the results, in percent identity, must be manually 
corrected. This is because the FASTDB program does not account for N- and C-terminal 

10 truncations of the subject sequence when calculating global percent identity. For subject 

sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is 
corrected by calculating the number of residues of the query sequence that are N- and C-terminal 
of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a 
percent of the total bases of the query sequence. Whether a residue is matched/aligned is 

15 determined by results of the FASTDB sequence alignment. This percentage is then subtracted 
from the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score is what is 
used for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are considered for the 

20 purposes of manually adjusting the percent identity score. That is, only query amino acid 
residues outside the farthest N- and C-terminal residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query 
sequence to determine percent identity. The deletion occurs at the N-terminus of the subject 
sequence and therefore, the FASTDB alignment does not match/align with the first 10 residues at 

25 the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at 
the N- and C- termini not matched/total number of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the remaining 
90 residues were perfectly matched the final percent identity would be 90%. In another example, 
a 90 residue subject sequence is compared with a 100 residue query sequence. This time the 

30 deletions are internal so there are no residues at the N- or C-termini of the subject sequence which 
are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is 
not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of 
the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned 
with the query sequence are manually corrected. No other manual corrections are to made for the 

35 purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they have their 
normal biological activity. This is because even where a particular polypeptide molecule does not 
have biological activity, one of skill in the art would still know how to use the polypeptide, for 
instance, as a vaccine or to generate antibodies. Other uses of the polj^eptides of the present 
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invention that do not have B. burgdorferi activity include, inter alia, as epitope tags, in epitope 
mapping, and as molecular weight markers on SDS-PAGE gels or on molecular sieve gel 
filtration columns using methods known to those of skill in the art. 

As described below, the polypeptides of the present invention can also be used to raise 
5 polyclonal and monoclonal antibodies, which are useful in assays for detecting B, burgdorferi 
protein expression or as agonists and antagonists capable of enhancing or inhibiting B. 
burgdorferi protein function. Further, such polypeptides can be used in the yeast two-hybrid 
system to "capture" B. burgdorferi protein binding proteins which are also candidate agonists and 
antagonists according to the present invention. See, e.g.. Fields et al. (1989) Nature 
10 340:245-246. 

Epitope-B earing Portions 

In another aspect, the invention provides peptides and polypeptides comprising 
epitope-bearing portions of the B. burgdorferi polypeptides of the present invention. These 

15 epitopes are inimunogenic or antigenic epitopes of the polypeptides of the present invention. An 
"immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the 
whole protein or polypeptide is the immunogen. These immunogenic epitopes are believed to be 
confined to a few loci on the molecule. On the other hand, a region of a protein molecule to 
which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The 

20 number of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 
Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 4 only lists 
amino acid residues comprising epitopes predicted to have the highest degree of antigenicity. The 
polypeptides not listed in Table 4 and portions of polypeptides not listed in Table 4 are not 

25 considered non-antigenic. This is because they may still be antigenic in vivo but merely not 

recognized as such by the particular algorithm used. Thus, Table 4 lists the amino acid residues 
comprising preferred antigenic epitopes but not a complete list. Amino acid residues comprising 
other anigenic epitopes may be determined by algorithms similar to the Jameson- Wolf analysis or 
by in vivo testing for an antigenic response using the methods described herein or those known in 

30 the art. 

As to the selection of peptides or polypeptides bearing an antigenic epitope (/.^., that 
contain a region of a protein molecule to which an antibody can bind), it is well known in that art 
that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable 
of eliciting an antiserum that reacts with the partially mimicked protein. See, e.g., Sutcliffe, et 
35 al., (1983) Science 219:660-666. Peptides capable of eliciting protein-reactive sera are frequently 
represented in the primary sequence of a protein, can be characterized by a set of simple chemical 
rules, and are confined neither to immunodominant regions of intact proteins (i.e., inmiunogenic 
epitopes) nor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective at inducing antibodies that bind to the 
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mimicked protein; longer, peptides, especially those containing proline residues, usually are 
effective. See, Sutcliffe, et al,, supra, p. 661. For instance, 18 of 20 peptides designed 
according to these guidelines, containing 8-39 residues covering 75% of the sequence of the 
influenza virus hemagglutinin HAl polypeptide chain, induced antibodies that reacted with the 
5 HAl protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the 
rabies glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful 
to raise antibodies, including monoclonal antibodies, that bind specifically to a polypeptide of the 
invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors 

10 immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the 
native protein. See Sutcliffe, et al., ^wpra, p. 663. The antibodies raised by antigenic 
epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies 
to different peptides may be used for tracking the fate of various regions of a protein precursor 
which undergoes post-translational processing. The peptides and anti-peptide antibodies may be 

15 used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in 
competition assays since it has been shown that even short peptides (e.g., about 9 amino acids) 
can bind and displace the larger peptides in immunoprecipitation assays. See, e.g., Wilson, et 
al., (1984) Cell 37:767-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography using methods 

20 known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention designed according 
to the above guidelines preferably contain a sequence of at least seven, more preferably at least 
nine and most preferably between about 10 to about 50 amino acids (i.e. any integer between 7 
and 50) contained within the amino acid sequence of a polypeptide of the invention. However, 

25 peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide 
of the invention, containing about 50 to about 100 amino acids, or any length up to and including 
the entire amino acid sequence of a polypeptide of the invention, also are considered 
epitope-bearing peptides or polypeptides of the invention and also are useful for inducing 
antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the 

30 epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents (i.e., the 
sequence includes relatively hydrophilic residues and highly hydrophobic sequences are 
preferably avoided); and sequences containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used to generate 
an Borrelia-specific immune response or antibodies include portions of the amino acid sequences 

35 identified in Table 1. More specifically, Table 4 discloses a list of non-hmiting residues that are 
involved in the antigenicity of the epitope-bearing ft-agments of the present invention. Therefore, 
the present inventions provides for isolatd and purified antigenic epitope-bearing fragements of 
the polypeptides of the present invention comprising a peptide sequences of Table 4. The 
antigenic epitope-bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
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sequence of at least seven, more preferably at least nine and most preferably between about 10 to 
about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide of the present 
invention. That is, included in the present invention are antigenic polypeptides between the 
integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4, 
5 Therefore, in most cases, the polypeptides of Table 4 make up only a portion of the antigenic 

polypeptide. All combinations of sequences between the integers of 7 and 50 amino acid in length 
comprising one or more of the sequences of Table 4 are included. The antigenic epitope-bearing 
fragements may be specified by either the number of contiguous amino acid residues or by 
specific N-terminal and C-terminal positions as described above for the polypeptide fragements of 

10 the present invention, wherein the initiation codon is residue 1 . Any number of the described 
antigenic epitope-bearing fragements of the present invention may also be excluded from the 
present invention in the same manner. 

The epitope-bearing peptides and polypeptides of the invention may be produced by any 
conventional means for making peptides or polypeptides including recombinant means using 

15 nucleic acid molecules of the invention. For instance, an epitope-bearing amino acid sequence of 
the present invention may be fused to a larger polypeptide which acts as a carrier during 
recombinant production and purification, as well as during immunization to produce anti-peptide 
antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical 
synthesis. For instance, Houghten has described a simple method for synthesis of large numbers 

20 of peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid 
variants of a segment of the HAl polypeptide which were prepared and characterized (by 
ELISA-type binding studies) in less than four weeks (Houghten, R. A. Proc. Natl. Acad. Sci. 
USA 82:5131-5135 (1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS)" process 
is further described in U.S. Patent No. 4,631,21 1 to Houghten and coworkers (1986). In this 

25 procedure the individual resins for the solid-phase synthesis of various peptides are contained in 
separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps 
involved in solid-phase methods. A completely manual procedure allows 500-l(X)0 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. Sci. 
82:5131-5135 at 5134. 

30 Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies 

according to methods well known in the art. See, e.g., Sutcliffe, et al., supra;; Wilson, et al., 
supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. Generally, animals may be 
immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of 
the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus 

35 toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such 
as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be 
coupled to carrier using a more general linking agent such as glutaraldehyde. Animals such as 
rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by 
intraperitoneal and/or intradermal injection of emulsions containing about 100 (Xg peptide or 
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carrier protein and Freund's adjuvant. Several booster injections may be needed, for instance, at 
intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be 
detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by selection of 
anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of 
the selected antibodies according to methods well known in the art. 

Immunogenic epi tope-bearing peptides of the invention, i.e., those parts of a protein that 
elicit an antibody response when the whole protein is the immunogen, are identified according to 
methods known in the art. For instance, Geysen, et al, supra, discloses a procedure for rapid 
concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an 
ELISA. Interaction of synthesized peptides with antibodies is then easily detected without 
removing them from the support. In this manner a peptide bearing an immunogenic epitope of a 
desired protein may be identified routinely by one of ordinary skill in the art. For instance, the 
immunologically important epitope in the coat protein of foot-and-mouth disease virus was located 
by Geysen et al supra with a resolution of seven amino acids by synthesis of an overlapping set 
of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. 
Then, a complete replacement set of peptides in which all 20 amino acids were substituted in tum 
at every position within the epitope were synthesized, and the particular amino acids conferring 
specificity for the reaction with antibody were determined. Thus, peptide analogs of the 
epitope-bearing peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an 
immunogenic epitope of a desired protein. 

Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general method 
of detecting or determining the sequence of monomers (amino acids or other compounds) which 
is a topological equivalent of the epitope (i.e., a "mimotope") which is complementary to a 
particular paratope (antigen binding site) of an antibody of interest. More generally, U.S. Patent 
No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a 
sequence of monomers which is a topographical equivalent of a ligand which is complementary to 
the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 
to Houghten, R. A. et al (1996) discloses linear Cj-C^-alkyl peralkylated oligopeptides and sets 
and libraries of such peptides, as well as methods for using such oligopeptide sets and libraries 
for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor 
molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention 
also can be made routinely by these methods. The entire disclosure of each document cited in this 
section on "Polypeptides and Fragments" is hereby incorporated herein by reference. 

As one of skill in the art will appreciate, the polypeptides of the present invention and the 
epitope-bearing fragments thereof described above can be combined with parts of the constant 
domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. This has been shown, e,g., for 
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chimeric proteins consisting of the first two domains of the human CI>4-polypeptide and various 
domains of the constant regions of the heavy or light chains of mammalian immunoglobulins. 
(EPA 0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a 
disulfide-linked dimeric structure due to the IgG part can also be more efficient in binding and 
5 neutralizing other molecules than a monomeric B, burgdorferi polypeptide or fragment thereof 
alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. Nucleic acids encoding the 
above epitopes of B. burgdorferi polypeptides can also be recombined with a gene of interest as 
an epitope tag to aid in detection and purification of the expressed polypeptide. 

10 Antibodies 

B, burgdorferi protein-specific antibodies for use in the present invention can be raised 
against the intact jB. burgdorferi protein or an antigenic polypeptide fragment thereof, which may 
be presented together with a carrier protein, such as an albumin, to an animal system (such as 
rabbit or mouse) or, if it is long enough (at least about 25 amino acids), without a carrier. 

15 As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to 

include intact molecules, single chain whole antibodies, and antibody fragments. Antibody 
fragments of the present invention include Fab and F(ab')2 and other fragments including single- 
chain Fvs (scFv) and disulfide-linked Fvs (sdFv). Also included in the present invention are 
chimeric and humanized monoclonal antibodies and polyclonal antibodies specific for the 

20 polypeptides of the present invention. The antibodies of the present invention may be prepared by 
any of a variety of methods. For example, cells expressing a polypeptide of the present invention 
or an antigenic fragment thereof can be administered to an animal in order to induce the production 
of sera containing polyclonal antibodies. For example, a preparation of B, burgdorferi 
polypeptide or fragment thereof is prepared and purified to render it substantially free of natural 

25 contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

In a preferred method, the antibodies of the present invention are monoclonal antibodies or 
binding fragments thereof Such monoclonal antibodies can be prepared using hybridoma 
technology. See, e.g., Harlow et al., ANTIBODffiS: A LABORATORY MANUAL, (Cold 

30 Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et al., in: MONOCLONAL 
ANTIBODffiS AND T-CELL HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and 
F(ab')2 fragments may be produced by proteolytic cleavage, using enzymes such as papain (to 
produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, B. burgdorferi 
polypeptide-binding fragments, chimeric, and humanized antibodies can be produced through the 

35 application of recombinant DNA technology or through synthetic chemistry using methods known 
in the art. 

Altematively, additional antibodies capable of binding to the polypeptide antigen of the 
present invention may be produced in a two-step procedure through the use of anti-idiotypic 
antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that. 
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therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance 
with this method, B, burgdorferi polypeptide-specific antibodies are used to immunize an animal, 
preferably a mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody whose ability 
to bind to the B. burgdorferi polypeptide-specific antibody can be blocked by the B. burgdorferi 
polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the B. burgdorferi 
polypeptide-specific antibody and can be used to immunize an animal to induce formation of 
further B. burgdorferi polypeptide-specific antibodies. 

Antibodies and fragements thereof of the present invention may be described by the 
portion of a polypeptide of the present invention recognized or specifically bound by the antibody. 
Antibody binding fragements of a polypeptide of the present invention may be described or 
specified in the same manner as for polypeptide fragements discussed above., i.e, by N-terminal 
and C-terminal positions or by size in contiguous amino acid residues. Any number of antibody 
binding fragments, of a polypeptide of the present invention, specified by N-terminal and C- 
terminal positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the specifically 
bind a particuarlly discribed fragement of a polypeptide of the present invention and allows for the 
exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be described or 
specified in terms of their cross-reactivity. Antibodies and fragements that do not bind 
polypeptides of any other species of Borrelia other than B. burgdorferi are included in the present 
invention. Likewise, antibodies and fragements that bind only species of Borrelia^ i.e. antibodies 
and fragements that do not bind bacteria from any genus other than Borrelia, are included in the 
present invention. 



Diagnostic Assays 

The present invention fiirther relates to methods for assaying staphylococcal infection in 
an animal by detecting the expression of genes encoding staphylococcal polypeptides of the 
present invention. The methods comprise analyzing tissue or body fluid from the animal for 
fiorre/ia-specific antibodies, nucleic acids, or proteins. Analysis of nucleic acid specific to 
Borrelia is assayed by PGR or hybridization techniques using nucleic acid sequences of the 
present invention as either hybridization probes or primers. See, e.g., Sambrook et al. Molecular 
cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing differentiation 
among spotted fever group Rickettsiae species by analysis of restriction fragment length 
polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. Microbiol. 32:589-595 
(detecting B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Borrelia has already been 
made, the present invention is useful for monitoring progression or regression of the disease state 
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whereby patients exhibiting enhanced Borrelia gene expression will experience a worse clinical 
outcome relative to patients expressing these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an animal, cell 
line, tissue culture, or other source which contains Borrelia polypeptide, mRNA, or DNA. 
5 Biological samples include body fluids (such as saliva, blood, plasma, urine, mucus, synovial 
fluid, etc.) tissues (such as muscle, skin, and cartilage) zmd any other biological source suspected 
of containing Borrelia polypeptides or nucleic acids. Methods for obtaining biological samples 
such as tissue are well known in the art. 

The present invention is useful for detecting diseases related to Borrelia infections in 

10 animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, pigs, mice, horses, 
rabbits and humans. Particularly preferred are humans. 

Total RNA can be isolated from a biological sample using any suitable technique such as 
the single-step guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski et 
al. (1987) Anal. Biochem. 162:156-159. mRNA encoding Borrelia polypeptides having sufficient 

15 homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between 
complementary sequences are then assayed using any appropriate method. These include 
Northern blot analysis, SI nuclease mapping, the polymerase chain reaction (PGR), reverse 
transcription in combination with the polymerase chain reaction (RT-PCR), and reverse 
transcription in combination with the ligase chain reaction (RT-LCR). 

20 Northem blot analysis can be performed as described in Harada et al. (1990) Cell 

63:303-312. Briefly, total RNA is prepared from a biological sample as described above. For the 
Northem blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl 
sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred 
onto a nitrocellulose filter. After the RN As have been linked to the filter by a UV linker, the filter 

25 is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon 
sperm, SDS, and sodium phosphate buffer. A B, burgdorferi polynucleotide sequence shown in 
Table 1 labeled according to any appropriate method (such as the ^^P-multiprimed DNA labehng 
system (Amersham)) is used as probe. After hybridization overnight, the filter is washed and 
exposed to x-ray film. DNA for use as probe according to the present invention is described in 

30 the sections above and will preferably at least 15 nucleotides in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. To 
prepare probe DNA for use in S 1 mapping, the sense strand of an above-described B, burgdorferi 
DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. 
The antisense DNA can then be digested using an appropriate restriction endonuclease to generate 

35 further DNA probes of a desired length. Such antisense probes are useful for visualizing 

protected bands corresponding to the target mRNA (/.e., mRNA encoding Borrelia polypeptides). 

Levels of mRNA encoding Borrelia polypeptides are assayed, for e.g., using the RT-PCR 
method described in Makino et al. (1990) Technique 2:295-301. By this method, the 
radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial 
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concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from 
a biological sample in a reaction mixture containing a RT primer and appropriate buffer. After 
incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, 
DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
5 of the RNA, the RT products are then subject to PGR using labeled primers. Alternatively, rather 
than labeling the primers, a labeled dNTP can be included in the PGR reaction mixture. PGR 
amplification can be performed in a DNA thermal cycler according to conventional techniques. 
After a suitable number of rounds to achieve amplification, the PGR reaction mixture is 
electrophoresed on a polyacrylamide gel. After drying the gel, the radioactivity of the appropriate 

10 bands (corresponding to the mRNA encoding the Borrelia polypeptides of the present invention) 
are quantified using an imaging analyzer. RT and PGR reaction ingredients and conditions, 
reagent and gel concentrations, and labeling methods are well known in the art. Variations on the 
RT-PGR method will be apparent to the skilled artisan. Other PGR methods that can detect the 
nucleic acid of the present invention can be found in PGR PRIMER: A LABORATORY 

15 MANUAL (G.W. Dieffenbach et al. eds.. Gold Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, may be used 
to detect polynucleotides of the present invention or Borrelia species including B. burgdorferi 
using bio chip technology. The present invention includes both high density chip arrays (>1000 
oligonucleotides per cm^) and low density chip arrays (<1000 oligonucleotides per cm^). Bio 

20 chips comprising arrays of polynucleotides of the present invention may be used to detect Borrelia 
species, including B. burgdorferi, in biological and environmental samples and to diagnose an 
animal, including humans, with an B, burgdorferi or other Borrelia infection. The bio chips of 
the present invention may comprise polynucleotide sequences of other pathogens including 
bacteria, viral, parasitic, and fungal polynucleotide sequences, in addition to the polynucleotide 

25 sequences of the present invention, for use in rapid diffenertial pathogenic detection and 
diagnosis. The bio chips can also be used to monitor an B. burgdorferi or other Borrelia 
infections and to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response 
to drug therapy in the clinic and drug development in the laboratory. The bio chip technology 
comprising arrays of polynucleotides of the present invention may also be used to simultaneously 

30 monitor the expression of a multiplicity of genes, including those of the present invention. The 
polynucleotides used to comprise a selected array may be specified in the same manner as for the 
fragements, i.e, by their 5' and 3' positions or length in contigious base pairs and include from. 
Methods and particular uses of the polynucleotides of the present invention to detect Borrelia 
species, including B. burgdorferi, using bio chip technology include those known in the art and 

35 those of: U.S. Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 
5527681, 5451683, 5424186, 5607646, 5658732 and Worid Patent Nos. WO/9710365, 
WO/951 1995, WO/9743447, WO/9535505, each incorporated herein in their entireties. 

Biosensors using the polynucleotides of the present invention may also be used to detect, 
diagnose, and monitor B, burgdorferi or other BorreUa species and infections thereof. 
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Biosensors using the polynucleotides of the present invention may also be used to detect particular 
polynucleotides of the present invention. Biosensors using the polynucleotides of the present 
invention may also be used to monitor the genetic changes (deletions, insertions, mismatches, 
etc.) in response to drug therapy in the clinic and drug development in the laboratory. Methods 
5 and particular uses of the polynucleotides of the present invention to detect Borrelia species, 
including B, burgdorferU using biosenors include those known in the art and those of: U.S. 
Patent Nos 5721 102, 5658732, 5631 170, and World Patent Nos. WO97/3501 1, WO/9720203, 
each incorporated herein in their entireties. 

Thus, the present invention includes both bio chips and biosensors comprising 

10 polynucleotides of the present invention and methods of their use. 

Assaying Borrelia polypeptide levels in a biological sample can occur using any art-known 
method, such as antibody-based techniques. For example, Borrelia polypeptide expression in 
tissues can be studied with classical immunohistological methods. In these, the specific 
recognition is provided by the primary antibody (polyclonal or monoclonal) but the secondary 

15 detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a 
result, an immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of Borrelia 
polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. et al. (1985) J. Cell. 
Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . BioL 105:3087-3096. In this technique, 

20 which is based on the use of cationic solid phases, quantitation of a Borrelia polypeptide can be 
accomplished using an isolated Borrelia polypeptide as a standard. This technique can also be 
applied to body fluids. 

Other antibody-based methods useful for detecting Borrelia polypeptide gene expression 
include immunoassays, such as the ELISA and the radioimmunoassay (RIA). For example, a 

25 Borrelia polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and 
as an enzyme-labeled probe to detect and quantify a Borrelia polypeptide. The amount of a 
Borrelia polypeptide present in the sample can be calculated by reference to the amount present in 
a standard preparation using a linear regression computer algorithm. Such an ELISA is described 
in lacobelli et al. (1988) Breast Cancer Research and Treatment 1 1: 19-30. In another ELISA 

30 assay, two distinct specific monoclonal antibodies can be used to detect Borrelia polypeptides in a 
body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as 
the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or "two-step" assay. 
The "one-step" assay involves contacting the Borrelia polypeptide with immobilized antibody and. 

35 without washing, contacting the mixture with the labeled antibody. The "two-step" assay 

involves washing before contacting the mixture with the labeled antibody. Other conventional 
methods may also be employed as suitable. It is usually desirable to inmiobilize one component 
of the assay system on a support, thereby allowing other components of the system to be brought 
into contact with the component and readily removed from the sample. Variations of the above 
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and other immunological methods included in the present invention can also be found in Harlow 
et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 
2nded. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, which 
catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase is 
particularly preferred as it has good stability and its substrate (glucose) is readily available. 
Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide 
formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels 
include radioisotopes, such as iodine (*^^I, *^'I), carbon ('^C), sulphur (^^S), tritium (^H), indium 
("^In), and technetium (^"Tc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

Further suitable labels for the Borrelia polypeptide-specific antibodies of the present 
invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, 
Borrelia nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol 
phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase. 

Examples of suitable radioisotopic labels include ^H, *^*In, ^'^I, ''P, ''S, '^C, ''Cr, 
"'To, '«Co, '"Fe, ''Se, '"Eu, ""Y, ''Cu, ^'^Ci, '*^At, ^'^Pb, "'Sc, ^^^Pd, etc. '**In is a preferred 
isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the '^I or 
'^'l-labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more 
favorable gamma emission energy for imaging. See, e.g., Perkins et al. (1985) Eur. J. Nucl. 
Med. 10:296-301; Carasquillo et al. (1987) J. Nucl. Med. 28:281-287. For example, ''*In 
coupled to monoclonal antibodies with l-(P-isothiocyanatobenzyl)-DPTA has shown little uptake 
in non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
locaHzation. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include '^^Gd, ^^Mn, '^"Dy, ^^Tr, and 

^^Fe. 

Examples of suitable fluorescent labels include an *^^Eu label, a fluorescein label, an 
isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, ricin, and 
cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal label, an 
aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, 
a luciferin label, a luciferase label, and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei 
such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are provided by 
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Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) Clin. Chim. Acta 
8 1 : 1-40. Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate 
method, the dimaleimide method, the m-maleimidobenzyl-N-hydroxy-succinimide ester method, 
all of which methods are incorporated by reference herein. 

In a related aspect, the invention includes a diagnostic kit for use in screening serum 
containing antibodies specific against B. burgdorferi infection. Such a kit may include an 
isolated B, burgdorferi antigen comprising an epitope which is specifically immunoreactive with 
at least one anti-R burgdorferi antibody. Such a kit also includes means for detecting the 
binding of said antibody to the antigen. In specific embodiments, the kit may include a 
recombinandy produced or chemically synthesized peptide or polypeptide antigen. The peptide or 
polypeptide antigen may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-described kit includes a 
solid support to which said peptide or polypeptide antigen is attached. Such a kit may also 
include a non-attached reporter-labeled anti-human antibody. In this embodiment, binding of the 
antibody to the B. burgdorferi antigen can be detected by binding of the reporter labeled antibody 
to the anti-fi. burgdorferi polypeptide antibody. 

In a related aspect, the invention includes a method of detecting B, burgdorferi infection in 
a subject. This detection method includes reacting a body fluid, preferably serum, from the 
subject with an isolated B. burgdorferi antigen, and examining the antigen for the presence of 
bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to 
a solid support, and serum is reacted with the support. Subsequently, the support is reacted with 
a reporter-labeled anti-human antibody. The support is then examined for the presence of 
reporter-labeled antibody. 

The solid surface reagent employed in the above assays and kits is prepared by known 
techniques for attaching protein material to solid support material, such as polymeric beads, dip 
sticks, 96-well plates or filter material. These attachment methods generally include non-specific 
adsorption of the protein to the support or covalent attachment of the protein , typically through a 
free amine group, to a chemically reactive group on the solid support, such as an activated 
carboxyl, hydroxyl, or aldehyde group. Altematively, streptavidin coated plates can be used in 
conjunction with biotinylated antigen(s). 

The polypeptides and antibodies of the present invention, including fragments thereof, 
may be used to detect Borrelia species including B. burgdorferi using bio chip and biosensor 
technology. Bio chip and biosensors of the present invention may comprise the polypeptides of 
the present invention to detect antibodies, which specifically recognize Borreha species, including 
B, burgdorferi. Bio chip and biosensors of the present invention may also comprise antibodies 
which specifically recognize the polypeptides of the present invention to detect Borrelia species, 
including B, burgdorferi or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be used to detect 
Borreha species, including B, burgdorferi, in biological and environmental samples and to 
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diagnose an animal, including humans, with an B, burgdorferi or other Borrelia infection. Thus, 
the present invention includes both bio chips and biosensors comprising polypeptides or 
antibodies of the present invention and methods of their use. 

The bio chips of the present invention may further comprise polypeptide sequences of 
other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition 
to the polypeptide sequences of the present invention, for use in rapid diffenertial pathogenic 
detection and diagnosis. The bio chips of the present invention may further comprise antibodies 
or fragements thereof specific for other pathogens including bacteria, viral, parasitic, and fungal 
polypeptide sequences, in addition to the antibodies or fragements thereof of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an B. burgdorferi or other 
Borrelia infection and to monitor the genetic changes (amio acid deletions, insertions, 
substitutions, etc.) in response to drug therapy in the clinic and drug development in the 
laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the present 
invention may also be used to simultaneously monitor the expression of a multiplicity of 
polypeptides, including those of the present invention. The polypeptides used to comprise a bio 
chip or biosensor of the present invention may be specified in the same manner as for the 
fragements, i.e, by their N-terminal and C-terminal positions or length in contigious amino acid 
residue. Methods and particular uses of the polypeptides and antibodies of the present invention 
to detect Borrelia species, including B. burgdorferi, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. and World 
Patent Nos. listed above for bio chips and biosensors using polynucleotides of the present 
invention, and those of: U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and 
World Patent Nos. W09729366, W096 12957, each incorporated herein in their entireties. 

Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify those which 
enhance or block the biological activity of the B. burgdorferi polypeptides of the present 
invention. The present invention further provides where the compounds kill or slow the growth 
of B, burgdorferi. The ability of B. burgdorferi antagonists, including B. burgdorferi ligands, to 
prophylactically or therapeutically block antibiotic resistance may be easily tested by the skilled 
artisan. See, e.g., Straden et al. (1997) J Bacteriol. 179(1):9-16. 

An agonist is a compound which increases the natural biological function or which 
functions in a manner similar to the polypeptides of the present invention, while antagonists 
decrease or eliminate such functions. Potential antagonists include small organic molecules, 
peptides, polypeptides, and antibodies that bind to a polypeptide of the invention and thereby 
inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross bridge 
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formation. Antibodies against 5. burgdorferi may be employed to bind to and inhibit B, 
burgdorferi activity to treat antibiotic resistance. Any of the above antagonists may be employed 
in a composition with a pharmaceutically acceptable carrier. 



5 Vaccines 

The present invention also provides vaccines comprising one or more polypeptides of the 
present invention. Heterogeneity in the composition of a vaccine may be provided by combining 
B. burgdorferi polypeptides of the present invention. Multi-component vaccines of this type are 
desirable because they are likely to be more effective in eliciting protective immune responses 

10 against multiple species and strains of the Borrelia genus than single polypeptide vaccines. Thus, 
as discussed in detail below, a multi-component vaccine of the present invention may contain one 
or more, preferably 2 to about 20, more preferably 2 to about 15, and most preferably 3 to about 
8, of the S. burgdorferi polypeptides shown in Table 1, or fragments thereof. 

Multi-component vaccines are known in the art to elicit antibody production to numerous 

15 immunogenic components. Decker, M. and Edwards, K., J, Infect, Dis, 174:8270-275 (1996). 
In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has recently been 
demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic 
agents. Aristegui, J. etaL, Vaccine 75:7-9 (1997). 

The present invention thus also includes multi-component vaccines. These vaccines 

20 comprise more than one polypeptide, immunogen or antigen. An example of such a multi- 
component vaccine would be a vaccine comprising more than one of the fl. burgdorferi 
polypeptides shown in Table 1 . A second example is a vaccine comprising one or more, for 
example 2 to 10, of the B. burgdorferi polypeptides shown in Table 1 and one or more, for 
example 2 to 10, additional polypeptides of either borrelial or non-borrelial origin. Thus, a multi- 

25 component vaccine which confers protective immunity to both a borrelial infection and infection 
by another pathogenic agent is also within the scope of the invention. 

As indicated above, the vaccines of the present invention are expected to elicit a protective 
immune response against infections caused by species and strains of Borrelia other than B, 
burgdorferi sensu stricto isolate B31 (ATCC Accession No. 35210). Immunizations using 

30 decorin-binding protein and OspA derived from one strain of B. burgdorferi has been shown to 
elicit the production of antiserum which confers passive immunity against other strains of fi. 
burgdorferi, Cassatt, D. et al. Protection of Borrelia burgdorferi Infection by Antibodies to 
Decorin-binding Protein^ in Vaccines97, Cold Spring Harbor Press (1997), pages 191-195. 
Further, the inventors have found using an in vitro assay that antiserum produced in response to 

35 B, burgdorferi decorin-binding protein will kill several species of Borrelia, The amino acid 
sequences of decorin-binding protein expressed by different strains of B, burgdorferi are 
believed to diverge by as much as 25%. Thus, antisera elicited against decorin-binding proteins 
confers passive immunity against Borrelia expressing proteins having only 75% or less amino 
acid sequence similarity. 
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Further within the scope of the invention are whole cell and whole viral vaccines. Such 
vaccines may be produced recombinantly and involve the expression of one or more of the 
B, burgdorferi polypeptides shown in Table 1. For example, the B, burgdorferi polypeptides of 
the present invention may be either secreted or localized intracellular, on the cell surface, or in the 
periplasmic space. Further, when a recombinant virus is used, the B, burgdorferi polypeptides 
of the present invention may, for example, be localized in the viral envelope, on the surface of the 
capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing 
heterologous proteins are known in the art. See, e.g., Robinson, K. et al. Nature Biotech. 
75:653-657 (1997); Sirard, J. et al, Infect. Immun. 65:2029-2033 (1997); Chabalgoity, J. et ai, 
Infect. Immun. 65:2402-2412 (1997). These cells may be administered live or may be killed 
prior to administration. Chabalgoity, J. et ai, supra^ for example, report the successful use in 
mice of a live attenuated Salmonella vaccine strain which expresses a portion of a platyhelminth 
fatty acid-binding protein as a fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in the art by 
combining one or more B. burgdorferi polypeptides of the present invention, or fragments 
thereof, with additional non-borrelial components {e.g., diphtheria toxin or tetanus toxin, and/or 
other compounds known to elicit an immune response). Such vaccines are useful for eliciting 
protective inmiune responses to both members of the Borrelia genus and non-borrelial pathogenic 
agents. 

The vaccines of the present invention also include DNA vaccines. DNA vaccines are 
currently being developed for a number of infectious diseases. Boyer, J et ai, Nat. Med. 5:526- 
532 (1997); reviewed in Spier, R., Vaccine 14: 1285-1288 (1996). Such DNA vaccines contain a 
nucleotide sequence encoding one or more B. burgdorferi polypeptides of the present invention 
oriented in a manner that allows for expression of the subject polypeptide. The direct 
administration of plasmid DNA encoding OspA has been shown to elicit protective immunity in 
mice against borrelial challenge. Luke, C. etal, J. Infect. Dis. 775:91-97 (1997). 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim, J. et ai, Nature 
Biotech. 75:641-646 (1997), for example, report the enhancement of immune responses produced 
by DNA immunizations when DNA sequences encoding molecules which stimulate the immune 
response are co-administered. In a similar fashion, the vaccines of the present invention may be 
co-administered with either nucleic acids encoding immune modulators or the immune modulators 
themselves. These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invention may be used to confer resistance to borrelial 
infection by either passive or active inmiunization. When the vaccines of the present invention are 
used to confer resistance to borrelial infection through active immunization, a vaccine of the 
present invention is administered to an animal to elicit a protective immune response which either 
prevents or attenuates a borrelial infection. When the vaccines of the present invention are used to 
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confer resistance to borrelial infection through passive inununization, the vaccine is provided to a 
host animal (e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered 
land directiy provided to a recipient suspected of having an infection caused by a member of the 
Borrelia genus. 

The abihty to label antibodies, or fragments of antibodies, with toxin molecules provides 
an additional method for treating borrelial infections when passive immunization is conducted. In 
this embodiment, antibodies, or fragments of antibodies, capable of recognizing the 
B. burgdorferi polypeptides disclosed herein, or fragments thereof, as well as other Borrelia 
proteins, are labeled with toxin molecules prior to their administration to the patient. When such 
toxin derivatized antibodies bind to Borrelia cells, toxin moieties will be localized to these cells 
and will cause their death. 

The present invention thus concerns and provides a means for preventing or attenuating a 
borrelial infection resulting from organisms which have antigens that are recognized and bound by 
antisera produced in response to the polypeptides of the present invention. As used herein, a 
vaccine is said to prevent or attenuate a disease if its administration to an animal results either in 
the total or partial attenuation {i.e., suppression) of a symptom or condition of the disease, or in 
the total or partial immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are 
provided in advance of any symptoms of borrelial infection. The prophylactic administration of 
the compound(s) serves to prevent or attenuate any subsequent infection. When provided 
therapeutically, the compound(s) is provided upon or after the detection of symptoms which 
indicate that an animal may be infected with a member of the Borrelia genus. The therapeutic 
administration of the compound(s) serves to attenuate any actual infection. Thus, the 
B. burgdorferi polypeptides, and fragments thereof, of the present invention may be provided 
either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after 
the initiation of an actual infection. 

The polypeptides of the invention, whether encoding a portion of a native protein or a 
functional derivative thereof, may be administered in pure form or may be coupled to a 
macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable 
proteins which may act as macromolecular carrier for enhancing the immunogenicity of the 
polypeptides of the present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid, 
pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of 
the present invention to such macromolecular carriers are disclosed in Harlow et al.. Antibodies: 
A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1988), the entire disclosure of which is incorporated by reference herein. 

A composition is said to be "pharmacologically acceptable" if its administration can be 
tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an 
agent is said to be administered in a "therapeutically effective amount" if the amount administered 
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is physiologically significant. An agent is physiologically significant if its presence results in a 
detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that the 
composition of a pharmacologically acceptable compound varies with the animal to which it is 
administered. For example, a vaccine intended for human use will generally not be co- 
administered with Freund's adjuvant. Further, the level of purity of the B. burgdorferi 
polypeptides of the present invention will normally be higher when administered to a human than 
when administered to a non-human animal. 

As would be understood by one of ordinary skill in the art, when the vaccine of the 
present invention is provided to an animal, it may be in a composition which may contain salts, 
buffers, adjuvants, or other substances which are desirable for improving the efficacy of the 
composition. Adjuvants are substances that can be used to specifically augment a specific 
immune response. These substances generally perform two functions: (1) they protect the 
antigen(s) from being rapidly catabolized after administration and (2) they nonspecifically 
stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to the immune 
system, or presented separately, but into the same site of the animal being immunized. Adjuvants 
can be loosely divided into several groups based upon their composition. These groups include 
oil adjuvants (for example, Freund's complete and incomplete), mineral salts (for example, 
AIKCSOJ^, AlNa(S04)2, AINH^CSOJ, silica, kaolin, and carbon), polynucleotides (for example, 
poly IC and poly AU acids), and certain natural substances (for example, wax D from 
Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvum, or 
Bordetella pertussis, and members of the genus Brucella, Other substances useful as adjuvants 
are the saponins such as, for example. Qui! A. (Superfos A/S, Denmark). Preferred adjuvants for 
use in the present invention include aluminum salts, such as A1K(S04)2, AlNa(S04)2, and 
A1NH4(S04). Examples of materials suitable for use in vaccine compositions are provided in 
Remington's Pharmaceutical Sciences (Osol, A, Ed, Mack Publishing Co, Easton, PA, pp. 1324- 
1341 (1980), which reference is incorporated herein by reference). 

The therapeutic compositions of the present invention can be administered parenterally by 
injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, 
or orally. The compositions may alternatively be administered intramuscularly, or intravenously. 
Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance 
antigen absorption. Liquid dosage forms for oral administration may generally comprise a 
liposome solution containing the Uquid dosage form. Suitable forms for suspending liposomes 
include emulsions, suspensions, solutions, syrups, and ehxirs containing inert diluents 
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commonly used in the art, such as purified water. Besides the inert diluents, such compositions 
can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 
flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization of mice against Bordetella pertussis 
infection using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide- 
co-glycolide) has been shown to stimulate protective immune responses. Shahin, R. et al. 
Infect Immun. 63: 1195-1200 (1995). Similarly, orally administered encapsulated Salmonella 
typhimurium antigens have also been shown to elicit protective inmiunity in mice. AUaoui- 
Attarki, K. et aL, Infect, Immun. 65:853-857 (1997). Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving contacting the 
vaccine with mucous membranes (e.g., intranasally, intracolonicly, intraduodenally). 

Many different techniques exist for the timing of the immunizations when a multiple 
administration regimen is utilized. It is possible to use the compositions of the invention more 
than once to increase the levels and diversities of expression of the immunoglobulin repertoire 
expressed by the immunized animal. Typically, if multiple immunizations are given, they will be 
given one to two months apart. 

According to the present invention, an "effective amount" of a therapeutic composition is 
one which is sufficient to achieve a desired biological effect. Generally, the dosage needed to 
provide an effective amount of the composition will vary depending upon such factors as the 
animal's or human's age, condition, sex, and extent of disease, if any, and other variables which 
can be adjusted by one of ordinary skill in the art. 

The antigenic preparations of the invention can be administered by either single or multiple 
dosages of an effective amount. Effective amounts of the compositions of the invention can vary 
from 0.01-1,000 |ig/ml per dose, more preferably 0.1-500 )xg/ml per dose, and most preferably 
10-300 |Xg/ml per dose. 

Having now generally described the invention, the same will be more readily understood 
through reference to the following example which is provided by way of illustration, and is not 
intended to be limiting of the present invention, unless specified. 

Examples 

L Preparation of PCR Primers and Amplification of DNA 

Various fragments of the Borrelia burgdorferi genome, such as those of Table 1, can be 
used, in accordance with the present invention, to prepare PCR primers for a variety of uses. The 
PCR primers are preferably at least 15 bases, and more preferably at least 18 bases in length. 
When selecting a primer sequence, it is preferred that the primer pairs have approximately the 
same G/C ratio, so that melting temperatures are approximately the same. The PCR primers and 
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amplified DNA of this Example find use in the Examples that follow. 

2. Isolation of a Selected DNA Clone From B. burgdorferi 
Three approaches are used to isolate a B, burgdorferi clone comprising a polynucleotide of 
5 the present invention from any B. burgdorferi genomic DNA library. The B. burgdorferi strain 
B31PU has been deposited as a convienent source for obtaining a B. burgdorferi strain although a 
wide varity of strains B, burgdorferi strains can be used which are known in the art. 

B. burgdorferi genomic DNA is prepared using the following method. A 20ml overnight 
bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain Heart Infusion broth 
10 or Super broth), pelleted, ished two times with TES (30mM Tris-pH 8.0, 25mM EDTA, 50mM 
NaCl), and resuspended in 5ml high salt TES (2.5M NaCl). Lysostaphin is added to final 
concentration of approx 50ug/ml and the mixture is rotated slowly 1 hour at 37C to make 
protoplast cells. The solution is then placed in incubator (or place in a shaking water bath) and 
warmed to 55C. Five hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is 
15 then added to lyse the cells. Next, guanidine HCl is added to a final concentration of 7M (3.69g 
in 5.5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). A CsCl 
gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and overlaying with 
2.85M CsCl. The gradient is carefully overlayed with the DNA-containing GuHCl solution. The 
gradient is spun at 30,000 rpm, 20C for 24 hr and the lower DNA band is collected. The volume 
20 is increased to 5 ml with TE buffer. The DNA is then treated with protease K (10 ug/ml) 
overnight at 37 C, and precipitated with ethanol. The precipitated DNA is resuspended in a 
desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid B. burgdorferi 
genomic DNA library using a polynucleotide probe corresponding to a polynucleotide of the 
25 present invention. Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. The 

oligonucleotide is labeled, for instance, with ^^P-y-ATP using T4 polynucleotide kinase and 

purified according to routine methods. {See, e.g., Maniatis et al.. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The library is 

30 transformed into a suitable host, as indicated above (such as XL-1 Blue (Stratagene)) using 
techniques known to those of skill in the art. See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et 
al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 
1989). The transformants are plated on 1.5% agar plates (containing the appropriate selection 

35 agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. These plates 
are screened using Nylon membranes according to routine methods for bacterial colony screening. 
See, e.g., Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold 
Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN 
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MOLECULAR BIOLOGY (John Wiley and Sons, N. Y. 1989) or other techniques known to 
those of skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends of a 
polynucleotide of Table 1 are synthesized and used to amplify the desired DNA by PGR using a 
5 B. burgdorferi genomic DNA prep as a template. PGR is carried out under routine conditions, for 
instance, in 25 p.1 of reaction mixture with 0.5 ug of the above DNA template. A convenient 
reaction mixture is 1.5-5 mM MgCl^, 0.01% (w/v) gelatin, 20 |ilM each of dATP, dGTP, dOTP, 
dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PGR 

(denaturation at 94°G for 1 min; annealing at 55''G for 1 min; elongation at 72°G for 1 min) are 

10 performed with a Perkin-Elmer Getus automated thermal cycler. The amplified product is 

analyzed by agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PGR product is verified to be the selected sequence by subcloning and 
sequencing the DNA product. 

Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 

15 synthesized and used to generate a nucleotide sequence of desired length using PGR methods 
known in the art. 



3(a). Expression and Purification Borrelia polypeptides in E. coli 
The bacterial expression vector pQE60 is used for bacterial expression of some of the 

20 polypeptide fragements of the present invention. (QIAGEN, Inc., 9259 Eton Avenue, 

Ghatsworth, GA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains 
a bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome binding site 
("RBS"), six codons encoding histidine residues that allow affinity purification using nickel- 
nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) and suitable single 

25 restriction enzyme cleavage sites. These elements are arranged such that an inserted DNA 

fragment encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 X 
His tag") covalently linked to the carboxyl terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a B. burgdorferi protein of the present 
invention is amplified from B. burgdorferi genomic DNA using PGR oligonucleotide primers 

30 which anneal to the 5' and 3' sequences coding for the portions of the B. burgdorferi 

polynucleotide shown in Table 1. Additional nucleotides containing restriction sites to facilitate 
cloning in the pQE60 vector are added to the 5' and 3' sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an appropriate 
restriction site followed by nucleotides of the amino terminal coding sequence of the desired jB. 

35 burgdorferi polynucleotide sequence in Table 1. One of ordinary skill in the art would appreciate 
that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to 
amplify a DNA segment encoding any desired portion of the complete protein shorter or longer 
than the mature form. The 3* primer has a sequence containing an appropriate restriction site 
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followed by nucleotides complementary to the 3' end of the polypeptide coding sequence of Table 
1, excluding a stop codon, with the coding sequence aligned with the restriction site so as to 
maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified B, burgdorferi DNA fragment and the vector pQE60 are digested with 
5 restriction enzymes which recognize the sites in the primers and the digested DNAs are then 

ligated together. The B. burgdorferi DNA is inserted into the restricted pQE60 vector in a manner 
which places the B, burgdorferi protein coding region downstream from the IPTG-inducible 
promoter and in-frame with an initiating AUG and the six histidine codons. 

The ligation mixture is transformed into competent E. coli cells using standard procedures 

10 such as those described by Sambrook et al., supra.. E. coli strain M15/rep4, containing multiple 
copies of the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 
("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing a 5. burgdorferi polypeptide, is available 
commercially (QIAGEN, Inc., supra). Transformants are identified by their ability to grow on 

15 LB agar plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from 

resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PGR and 
DNA sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 
LB media supplemented with both ampicillin (100 |ig/ml) and kanamycin (25 |ig/ml). The O/N 
20 culture is used to inoculate a large culture, at a dilution of approximately 1 :25 to 1:250. The cells 

are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-p-D- 

thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

25 The cells are then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. The cell 

debris is removed by centrifugation, and the supernatant containing the 5. burgdorferi 
polypeptide is loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column 
(QL\GEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity 
are purified in a simple one-step procedure (for details see: The QIAexpressionist, 1995, 

30 QL\GEN, Inc., supra). Briefly the supematant is loaded onto the column in 6 M guanidine-HCl, 
pH 8, the column is fu-st washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the B. burgdorferi polypeptide is eluted 
with 6 M guanidine-HCl, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate-buffered saline 

35 (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the protein could be 
successfully refolded while immobilized on the Ni-NTA column. The recommended conditions 
are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 
mM Tris/HCl pH 7.4, containing protease inhibitors. The renaturation should be performed over 
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a period of 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 250 
mM immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 mM sodium 
acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 4° C or frozen at -80° C. 

The polypeptide of the present invention are also prepared using a non-denaturing protein 
purification method. For these polypeptides, the cell pellet from each liter of culture is 
resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer A = 50 mM Na-phosphate, 300 
mM NaCl, 10 mM 2-mercaptoethanol, 10% Glycerol, pH 7.5 with 1 tablet of Complete EDTA- 
free protease inhibitor cocktail (Boehringer Mannheim #1873580) per 50 ml of buffer). 
Absorbance at 550 nm is approximately 10-20 O.D./ml. The suspension is then put through three 
freeze/thaw cycles from -IQPC (using a ethanol-dry ice bath) up to room temperature. The cells 
are lysed via sonication in short 10 sec bursts over 3 minutes at approximately SOW while kept on 
ice. The sonicated sample is then centrifuged at 15,000 RPM for 30 minutes at 4*'C. The 
supernatant is passed through a colunm containing 1.0 ml of CL-4B resin to pre-clear the sample 
of any proteins that may bind to agarose non-specifically, and the flow-through fraction is 
collected. 

The pre-cleared flow-through is applied to a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag bind to the Ni-NTA 
resin with high affinity and can be purified in a simple one-step procedure. Briefly, the 
supernatant is loaded onto the column in Lysis Buffer A at 4^C, the column is first washed with 
10 volumes of Lysis Buffer A until the A280 of the eluate returns to the baseline. Then, the 
column is washed with 5 volumes of 40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) 
(Buffer B = 50 mM Na-Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 
500 mM Imidazole, pH of the final buffer should be 7.5). The protein is eluted off of the column 
with a series of increasing Imidazole solutions made by adjusting the ratios of Lysis Buffer A to 
Buffer B. Three different concentrations are used: 3 volumes of 75 mM Imidazole, 3 volumes of 
150 mM Imidazole, 5 volumes of 500 mM Imidazole. The fractions containing the purified 
protein are analyzed using 8 %, 10 % or 14% SDS-PAGE depending on the protein size. The 
purified protein is then dialyzed 2X against phosphate-buffered saline (PBS) in order to place it 
into an easily workable buffer. The purified protein is stored at 4°C or frozen at -80°. 

The following alternative method may be used to purify B, burgdorferi expressed in E coli 
when it is present in the form of inclusion bodies. Unless otherwise specified, all of the 

following steps are conducted at 4-lO^C. 

Upon completion of the production phase of the E, coli fermentation, the cell culture is 

cooled to 4-10'*C and the cells are harvested by continuous centrifiigation at 15,000 rpm (Heraeus 

Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 
amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 
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The cells are then lysed by passing the solution through a microfluidizer (Microfuidics, 
Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCl 
solution to a final concentration of 0.5 M NaCl, followed by centrifugation at 7000 x g for 15 
min. The resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 
5 7.4. 

The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine hydrochloride 
(GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 

B. burgdorferi polypeptide-containing supernatant is incubated at 4°C overnight to allow further 

GuHCl extraction. 

10 Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 

GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. 

The refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further 

purification steps. 

15 To clarify the refolded B. burgdorferi polypeptide solution, a previously prepared 

tangential filtration unit equipped with 0.16 fxm membrane filter with appropriate surface area 

(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The colunm 
is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 

20 1500 niM NaCl in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 

effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the B. burgdorferi polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 

25 Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 

acetate, pH 6.0. Both colunms are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions 
are collected under constant Ajgo monitoring of the effluent. Fractions containing the B, 

30 burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B. burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Conmiassie blue 

stained 16% SDS-PAGE gel when 5 ^ig of purified protein is loaded. The purified protein is also 

tested for endotoxin/LPS contamination, and typically the LPS content is less than 0.1 ng/ml 
35 according to LAL assays. 



3(b), Alternative Expression and Purification Borrelia polypeptides in E, 
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coli 

Tthe vector pQElO is alternatively used to clone and express some of the polypeptides of 
the present invention for use in the soft tissue and systemic infection models discussed below. 
The difference being such that an inserted DNA fragment encoding a polypeptide expresses that 
polypeptide with the six His residues (i.e., a "6 X His tag") covalently linked to the amino 
terminus of that polypeptide. The bacterial expression vector pQElO (QIAGEN, Inc., 9259 Eton 
Avenue, Chatsworth, CA, 9131 1) was used in this example . The components of the pQElO 
plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present 
invention expresses the polypeptide with the six His residues (/.e., a "6 X His tag")) covalently 
linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 1 were 
amplified using PCR oligonucleotide primers from genomic B, burgdorferi DNA. The PGR 
primers anneal to the nucleotide sequences encoding the desired amino acid sequence of a 
polypeptide of the present invention. Additional nucleotides containing restriction sites to 
facDitate cloning in the pQElO vector were added to the 5' and 3' primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5' and 3' primers were selected to 
amplify their respective nucleotide coding sequences. One of ordinary skill in the art would 
appreciate that the point in the protein coding sequence where the 5' and 3' primers begins may be 
varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present 
invention. The 5' primer was designed so the coding sequence of the 6 X His tag is aligned with 
the restriction site so as to maintain its reading frame with that of B, burgdorferi polypeptide. The 
3* was designed to include an stop codon. The amplified DNA fragment was then cloned, and the 
protein expressed, as described above for the pQE60 plasmid. 

The DNA sequences of Table 1 encoding amino acid sequences may also be cloned and 
expressed as fusion proteins by a protocol similar to that described directly above, wherein the 
pET-32b(+) vector (Novagen, 601 Science Drive, Madison, WI 5371 1) is preferentially used in 
place of pQElO. 

The above methods are not limited to the polypeptide fragements actually produced. The 
above method, like the methods below, can be used to produce either fiill length polypeptides or 
desired fragements therof . 

3(c). Alternative Expression and Purification of Borrelia polypeptides in 
E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this example 
(QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). However, in this example, the 
polypeptide coding sequence is inserted such that translation of the six His codons is prevented 
and, therefore, the polypeptide is produced with no 6 X His tag. 

The DNA sequence encoding the desired portion of the B, burgdorferi amino acid 
sequence is amplified from an B. burgdorferi genomic DNA prep the deposited DNA clones 
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using PGR oligonucleotide primers which anneal to the 5' and 3' nucleotide sequences 
conesponding to the desired portion of the B. burgdorferi polypeptides. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' and 3' 
primer sequences. 

For cloning a B, burgdorferi polypeptides of the present invention, 5* and 3' primers are 
selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art 
would appreciate that the point in the protein coding sequence where the 5' and 3' primers begin 
may be varied to amplify a DNA segment encoding any desired portion of a polypeptide of the 
present invention. The 3' and 5* primers contain appropriate restriction sites followed by 
nucleotides complementary to the 5* and 3' ends of the coding sequence respectively. The 3' 
primer is additionally designed to include an in-frame stop codon. 

The amplified B, burgdorferi DNA fragments and the vector pQE60 are digested with 
restriction enzymes recognizing the sites in the primers and the digested DNAs are then ligated 
together. Insertion of the B. burgdorferi DNA into the restricted pQE60 vector places the B, 
burgdorferi protein coding region including its associated stop codon downstream from the IPTG- 
inducible promoter and in-frame with an initiating AUG. The associated stop codon prevents 
translation of the six histidine codons downstream of the insertion point. 

The ligation mixture is transformed into competent E. coli cells using standard procedures 
such as those described by Sambrook et al. E. coli strain M15/rep4, containing multiple copies of 
the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 
("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing B. burgdorferi polypeptide, is available 
commercially (QIAGEN, Inc., supra), Transformants are identified by their ability to grow on 
LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from resistant 
colonies and the identity of the cloned DNA confirmed by restriction analysis, PGR and DNA 
sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 
LB media supplemented with both ampicilhn (100 jiig/ml) and kanamycin (25 |lg/ml). The O/N 
culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells 
are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. isopropyl-b-D- 
thiogalactopyranoside ("BPTG") is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the B, burgdorferi polypeptide, the cells are then stirred for 3-4 hours at 4^C in 

6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the supernatant 
containing the B, burgdorferi polypeptide is dialyzed against 50 mM Na-acetate buffer pH 6, 
supplemented with 200 mM NaCl. Alternatively, the protein can be successfully refolded by 
dialyzing it against 500 mM NaCl, 20% glycerol, 25 mM Tris/HCl pH 7.4, containing protease 
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inhibitors. After renaturation the protein can be purified by ion exchange, hydrophobic interaction 
and size exclusion chromatography. Alternatively, an affinity chromatography step such as an 
antibody column can be used to obtain pure B. burgdorferi polypeptide. The purified protein is 
stored at 4° C or ft-ozen at -80° C. 
5 The following alternative method may be used to purify B. burgdorferi polypeptides 

expressed in E colt when it is present in the form of inclusion bodies. Unless otherwise 

specified, all of the following steps are conducted at 4-lO^C. 

Upon completion of the production phase of the E. coli fermentation, the cell culture is 

cooled to 4-10°C and the cells are harvested by continuous centrifiigation at 15,000 rpm (Heraeus 

10 Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 

amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4, The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer (Microfiiidics, 

15 Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCl 
solution to a final concentration of 0.5 M NaCl, followed by centrifiigation at 7000 x ^ for 15 
min. The resultant pellet is washed again using 0.5M NaCI, 100 mM Tris, 50 mM EDTA, pH 
7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride 
20 (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 

B, burgdorferi polypeptide-containing supernatant is incubated at 4®C overnight to allow further 

GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 
GuHCl solubiUzed protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
25 buffer containing 50 nvM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. 

The refolded diluted protein solution is kept at 4*^C without mixing for 12 hours prior to further 

purification steps. 

To clarify the refolded B. burgdorferi polypeptide solution, a previously prepared 
tangential filtration unit equipped with 0.16 \\m membrane filter with appropriate surface area 

30 (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column 
is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 5(X) mM, 1(X)0 mM, and 
1500 mM NaCl in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 
effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

35 Fractions containing the B, burgdorferi polypeptide are then pooled and mixed with 4 

volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 
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Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 
acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions 
5 are collected under constant A^^ monitoring of the effluent. Fractions containing the J5. 
burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B, burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Conmiassie blue 

stained 16% SDS-PAGE gel when 5 fxg of purified protein is loaded. The purified protein is also 

10 tested for endotoxin/LPS contamination, and typically the LPS content is less than 0. 1 ng/ml 
according to LAL assays. 

3(d). Cloning and Expression of B. burgdorferi in Other Bacteria 

B. burgdorferi polypeptides can also be produced in: JS. burgdorferi using the methods of 
15 S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. 1. Moreno (1996) Protein Expr. Purif. 
8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 AppL Microbiol. 
Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods Chang et al., U.S. Patent 
No. 4,952,508. 

20 4, Cloning and Expression in COS Cells 

A jS. burgdorferi expression plasmid is made by cloning a portion of the DNA encoding a 
B. burgdorferi polypeptide into the expression vector pDNAI/Amp or pDNAIII (which can be 
obtained from Invitrogen, Inc.). The expression vector pDNAI/amp contains: (1) an coli 
origin of replication effective for propagation in JE, coli and other prokaryotic cells; (2) an 

25 ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; (3) an SV40 origin 
of replication for propagation in eukaryotic cells; (4) a CMV promoter, a polylinker, an S V40 
intron; (5) several codons encoding a hemagglutinin fragment (i.e., an "HA" tag to facilitate 
purification) followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and operably linked to 

30 the S V40 intron and the polyadenylation signal by means of restriction sites in the polylinker. 

The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein described 
by Wilson et al. 1984 Cell 37:767. The fusion of the HA tag to the target protein allows easy 
detection and recovery of the recombinant protein with an antibody that recognizes the HA 
epitope. pDNAin contains, in addition, the selectable neomycin marker. 

35 A DNA fragment encoding a B. burgdorferi polypeptide is cloned into the polylinker 

region of the vector so that recombinant protein expression is directed by the CMV promoter. The 
plasmid construction strategy is as follows. The DNA from a B, burgdorferi genomic DNA prep 
is amplified using primers that contain convenient restriction sites, much as described above for 
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construction of vectors for expression of B, burgdorferi in E, coli. The 5' primer contains a 
Kozak sequence, an AUG start codon, and nucleotides of the 5* coding region of the B. 
burgdorferi polypeptide. The 3' primer, contains nucleotides complementary to the 3' coding 
sequence of the B, burgdorferi DNA, a stop codon, and a convenient restriction site. 
5 The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested with 

appropriate restriction enzymes and then ligated. The ligation mixture is transformed into an 

appropriate E. coli strain such as SURE^"^ (Stratagene Cloning Systems, La JoUa, CA 92037), 

and the transformed culture is plated on ampicilhn media plates which then are incubated to allow 
growth of ampicillin resistant colonies. Plasmid DNA is isolated from resistant colonies and 

10 examined by restriction analysis or other means for the presence of the fragment encoding the B. 
burgdorferi polypeptide 

For expression of a recombinant B. burgdorferi polypeptide, COS cells are transfected 
with an expression vector, as described above, using DEAE-dextran, as described, for instance, 
by Sambrook et al. {supra). Cells are incubated under conditions for expression of B. 

15 burgdorferi by the vector. 

Expression of the fl. burgdorferi-HA fusion protein is detected by radiolabeling and 
immunoprecipitation, using methods described in, for example Harlow et al., supra.. To this 
end, two days after transfection, the cells are labeled by incubation in media containing ^^S- 
cysteine for 8 hours. The cells and the media are collected, and the cells are washed and the lysed 

20 with detergent-containing RIPA buffer: 150 mM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% 
DOC, 50 mM TRIS, pH 7.5, as described by Wilson et al. {supra ). Proteins are precipitated 
from the cell lysate and from the culture media using an HA-specific monoclonal antibody. The 
precipitated proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
product of the expected size is seen in the cell lysate, which is not seen in negative controls. 

25 

5. Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of 5. burgdorferi polypeptide in this example. 
Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession No. 37146). The 
plasmid contains the mouse DHFR gene under control of the S V40 early promoter. Chinese 

30 hamster ovary cells or other cells lacking dihydrafolate activity that are transfected with these 
plasmids can be selected by growing the cells in a selective medium (alpha minus MEM, Life 
Technologies) supplemented with the chemotherapeutic agent methotrexate. The amplification of 
the DHFR genes in cells resistant to methotrexate (MTX) has been well documented. See, e.g., 
Alt et al., 1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys, 

35 Acta, 1097: 107-143; Page et al., 1991 , Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target enzyme, 
DHFR, as a result of amplification of the DHFR gene. If a second gene is linked to the DHFR 
gene, it is usually co-amplified and over-expressed. It is known in the art that this approach may 
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be used to develop cell lines carrying more than 1,000 copies of the amplified gene(s). 
Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain the 
amplified gene integrated into one or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) of the Rouse 
5 Sarcoma Virus, for expressing a polypeptide of interest, CuUen, et al. (1985) Mol. Cell. Biol. 
5:438-447; plus a fragment isolated from the enhancer of the immediate early gene of human 
cytomegalovirus (CMV), Boshart, et al., 1985, Cell 41:521-530. Downstream of the promoter 
are the following single restriction enzyme cleavage sites that allow the integration of the genes: 
Bam HI, Xba I, and A^p 718. Behind these cloning sites the plasmid contains the 3' intron and 

10 polyadenylation site of the rat preproinsulin gene. Other high efficiency promoters can also be 

used for the expression, e.g., the human 6-actin promoter, the SV40 eeirly or late promoters or the 
long terminal repeats from other retrovimses, e.g., HIV and HTLVI. Clontech's Tet-Off and Tet- 
On gene expression systems and similar systems can be used to express the B. burgdorferi 
polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, Proc. Natl. Acad. Sci. 

15 USA 89:5547-555 1. For the polyadenylation of the mRNA other signals, e.g., from the human 
growth hormone or globin genes can be used as well. Stable cell lines carrying a gene of interest 
integrated into the chromosomes can also be selected upon co-transfection with a selectable 
marker such as gpt, G418 or hygromycin. It is advantageous to use more than one selectable 
marker in the beginning, e.g., G418 plus methotrexate. 

20 The plasmid pC4 is digested with the restriction enzymes and then dephosphorylated 

using calf intestinal phosphates by procedures known in the art. The vector is then isolated from 
a 1% agarose gel. The DNA sequence encoding the B. burgdorferi polypeptide is amplified using 
PCR oligonucleotide primers corresponding to the 5' and 3' sequences of the desired portion of 
the gene. A 5' primer containing a restriction site, a Kozak sequence, an AUG start codon, and 

25 nucleotides of the 5' coding region of the B. burgdorferi polypeptide is synthesized and used. A 
y primer, contciining a restriction site, stop codon, and nucleotides complementary to the 3' 
coding sequence of the B. burgdorferi polypeptides is synthesized and used. The amplified 
fragment is digested with the restriction endonucleases and then purified again on a 1 % agarose 
gel. The isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. 

30 E. coli HBlOl or XL-1 Blue cells are then transformed and bacteria are identified that contain the 
fragment inserted into plasmid pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for transfection. Five 
^ig of the expression plasmid pC4 is cotransfected with 0.5 jxg of the plasmid pSVneo using a 

hpid-mediated transfection agent such as Lipofectin^*^ or LipofectAMINE.^" (LifeTechnologies 

35 Gaithersburg, MD), The plasmid pSV2-neo contains a dominant selectable marker, the neo gene 
from Tn5 encoding an enzyme that confers resistance to a group of antibiotics including G418. 
The cells are seeded in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the 
cells are trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
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MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml G418. After about 
10-14 days single clones are trypsinized and then seeded in 6- well petri dishes or 10 ml flasks 
using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). 
Clones growing at the highest concentrations of methotrexate are then transferred to new 6-well 
5 plates containing even higher concentrations of methotrexate (1 |liM, 2 |iM, 5 pM, 10 mM, 20 
mM). The same procedure is repeated until clones are obtained which grow at a concentration of 
100-200 ^M. Expression of the desired gene product is analyzed, for instance, by SDS-PAGE 
and Western blot or by reversed phase HPLC analysis. 

6. Immunization and Detection of Immune Responses 

10 

6(a). B, burgdorferi propagation 

B, burgdorferi sensu stricto isolate B31 is propagated in tightly-closed containers at 34°C 
in modified Barbour-Stoenner-Kelly (BSKII) medium (Barbour, A.G., Yale J, Biol Med. 
57:521-525 (1984)) overlaid with a 5%OJ5%CO^90%\>i^ gas mixture. Cell densities of these 

15 cultures are determined by darkfield microscopy at 400X. 

Immunization of Mice and Challenge with B. burgdorferi. For active immunizations 
BALB/cByJ mice (BALB, Jackson Laboratories) are injected intraperitoneally (i.p.) at week 0 
with 20 g of recombinant borrelial protein, or phosphate-buffered saline (PBS), emulsified with 
complete Freund's adjuveint (CFA), given a similar booster immunization in incomplete Freund's 

20 adjuvant (IF A) at week 4, and challenged at week 6. For challenge B, burgdorferi are diluted in 
BSKII from exponentially-growing cultures and mice are injected subcutaneously (s.c.) at the 
base of the tail with 0.1 ml of these dilutions (typically 10^-10"* borreliae; approximately 10-100 
times the median infectious dose). Borreliae used for challenge are passaged fewer than six times 
in vitro. To assess infection, mice are sacrificed at 14-17 days post-challenge, and specimens 

25 derived from ear, bladder, and tibiotarsal joints are placed in BSKH plus 1.4% gelatin, 13 g/ml 
amphotericin B, 1.5 g/ml phosphomycin, and 15 g/ml rifampicin, and borrelia outgrowth at two 
or three weeks is quantified by darkfield microscopy. Batches of BSKII are qualified for 
infection testing by confirming that they supported the growth of 1-5 cells of isolate B31. In 
some instances seroconversion for protein P39 reactivity is also used to confirm infections (see 

30 below). Others have previously shown that mice elicited antibodies to P39 when inoculated with 
live borreliae by syringe or tick bite, but not with killed borreliae (Simpson, W.J., et ai, J. Clin. 
Microbiol. 29:236-243 (1991)). 

6(b), Immunoassays 

35 Several inraiunoassay formats are used to quantify levels of borreha-specific antibodies 

(ELISA and inamunoblot), and to evaluate the functional properties of these antibodies (growth 
inhibition assay). The ELISA and immunoblot assays are also used to detect and quantify 
antibodies elicited in response to borrelial infection that react with specific borrelial antigens. 
Where antibodies to certain borrelial antigens are elicited by infection this is taken as evidence that 
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the borrelial proteins in question are expressed in vivo. Absence of infection-derived antibodies 
(seroconversion) following borrelial challenge is evidence that infection is prevented or 
suppressed. The inununoblot assay is also used to ascertain whether antibodies raised against 
recombinant borrelial antigens recognize a protein of similar size in extracts of whole borreUae. 
5 Where the natural protein is of similar, or identical, size in the immunoblot assay to the 

recombinant version of the same protein, this is taken as evidence that the recombinant protein is 
the product of a full-length clone of the respective gene. 

Enzyme-Linked Immunosorbant Assay (EUSA). The ELISA is used to quantify levels of 
antibodies reactive with borrelial antigens elicited in response to immunization with these borrelial 

10 antigens. Wells of 96 well microti ter plates (Immunlon 4, Dynatech, Chantilly, Virginia, or 
equivalent) are coated with antigen by incubating 50 1 of 1 g/ml protein antigen solution in a 
suitable buffer, typically 0. 1 M sodium carbonate buffer at pH 9.6. After decanting unbound 
antigen, additional binding sites are blocked by incubating 100 1 of 3% nonfat milk in wash 
buffer (PBS, 0.2% Tween 20, pH 7.4). After washing, duplicate serial two-fold dilutions of sera 

15 in PBS, Tween 20, 1% fetal bovine serum, are incubated for 1 hr, removed, wells are washed 
three times, and incubated with horseradish peroxidase-conjugated goat anti-mouse IgG. After 
three washes, bound antibodies are detected with H2O2 and 2,2'-azino-di-(3-ethylbenzthiazoline 
sulfonate) (Schwan, T.G., et ai, Proc, Natl Acad. Sci. USA 92:2909-2913 (1985)) (ABTS®, 
Kirkegaard & Perry Labs., Gaithersburg, MD) and A405 is quantified with a Molecular Devices, 

20 Corp. (Menlo Park, California) Vmax™ plate reader. IgG levels twice the background level in 
serum from naive mice are assigned the minimum titer of 1 : 100. 

6(c). In Vitro Growth Inhibition Assay 

Unlike other bacteria, borreliae can be killed by the binding of specific antibodies to their 
25 surface antigens. The mechanism for this in vitro killing or growth-inhibitory effect is not 

known, but can occur in the absence of serum complement, or other immune effector functions. 
Antibodies elicited in animals receiving immunizations with specific borrelial antigens that result 
in protection from borrelial challenge usually will direcdy kill borreliae in vitro. Thus, the in vitro 
growth inhibition assay also h£is a high predictive value for the protective potency of the borrelial 
30 antibodies, although exceptions, such as antibodies against OspC which are weak at in vitro 
growth inhibition, have been observed. Also, this assay can be used to evaluate the serologic 
conservation of epitope binding protective antibodies. A microwell antibody titration assay 
(Sadziene, A., et aL, 7. Infect. Dis. 167: 165-172 (1993)) is used to evaluate the growth inhibition 
(GI) properties of antisera against recombinant borrelial antigens against the homologous B3 1 
35 isolate, and against various strains of borrelia. Briefly, 10^ borrelia in 100 1 BSKII are added to 
serial two-fold dilutions of sera in 100 1 BSKII in 96- well plates, and the plates are covered and 
incubated at 34*'C in a 5%O2/5%CO2/90%N2 gas mixture for 72 h prior to quantification of 
borreUa growth by darkfield microscopy. 
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6(d). Sodiumdodecylsulfate-Polyacrylamide Gel Electrophoresis 
(SDS'PAGE) and Immunoblotting 

Using a single well format, total borrelial protein extracts, recombinant borrelial antigen, 
or recombinant P39 samples (2 g of purified protein, or more for total borrelial extracts) are 
5 boiled in SDS/2-ME sample buffer before electrophoresis through 3% acrylamide stacking gels, 
and resolving gels of higher acrylamide concentration, typically 10-15% acrylamide monomer. 
Gfels are electro-blotted to nitrocellulose membranes and lanes are probed with dilutions of 
antibody to be tested for reactivity with specific borrelial antigens, followed by the appropriate 
secondary antibody-enzyme (horseradish peroxidase) conjugate. When it is desirable to confirm 
10 that the protein had transferred following electro-blotting, membranes are stained with Ponceau S. 
Immunoblot signals from bound antibodies are detected on x-ray film as chemiluminescence using 
ECL™ reagents (Amersham Corp., Arlington Heights, Illinois). 

6(e). Detection of Borrelia mRNA expression 
15 Northern blot analysis is carried out using methods described by, among others, 

Sambrook et aL, supra, to detect the expression of the B, burgdorferi nucleotide sequences of the 

present invention in animal tissues. A cDNA probe containing an entire nucleotide sequence 

shown in Table 1 is labeled with ^^P using the redipnme^^ DNA labeling system (Amersham Life 

Science), according to manufacturer's instructions. After labeling, the probe is purified using a 
20 CHROMA SPrN-lOO""^ column (Clontech Laboratories, Inc.), according to manufacturer's 

protocol number PT 1200-1. The purified labeled probe is then used to detect the expression of 

Borrelia mRNA in an animal tissue sample. 

Animal tissues, such as blood or spinal fluid, are examined with the labeled probe using 

ExpressHyb'"'^ hybridization solution (Clontech) according to manufacturer's protocol number 
25 PTl 190-L Following hybridization and washing, the blots are mounted and exposed to film at - 

70 C overnight, and films developed according to standard procedures. 

The disclosure of all pubUcations (including patents, patent applications, journal articles, 

laboratory manuals, books, or other documents) cited herein are hereby incorporated by reference 

in their entireties. 

30 The present invention is not to be hmited in scope by the specific embodiments described 

herein, which eire intended as single illustrations of individual aspects of the invention. 
Functionally equivalent methods and components are within the scope of the invention, in 
addition to those shown and described herein and will become apparant to those skilled in the art 
from the foregoing description and accompanying drawings. Such modifications are intended to 

35 fall within the scope of the appended claims. 

Provisional Application Serial No. 60/057,483 filed 3 September 1997 is incorporated by 
reference herein in its entirety. 
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TABLE 1. Nucleotide and Amino Acid Sequences 
flOl.aa 

MSKIFLLFNAGFFFLKIIYVFSYPEIKNFSRQDPVFSDLKIKVLKYNKKQHIPLFFYSYKVKKGDTFFKIANKING 

WQSGIATINLLDSPAVSVGQEILIPSKKGVFWDSKDYRFNNLLLATRDLJUCAEKVKIKRNDRV^ 

DFGLFSGTELLFFLJSrANFIFPLKKFIVSSDFGFRJSfDPFTGNKSFHTGIDLAAPMNAEVYLLLLE 

tlOl .aa 

SYPEIKNFSRQDPWSDLKIKVLKYNKKQHIPLFFYSYKVKKGDTFFKIANKINGWQSGIATINLLDSPAVSVGQE 

ILIPSKKGVFVFDSKDYRFNNLLI^TRDLAKAEKVKIKRNDRVYEFYFFDFVKNPDFGLFSGTELLFFLNAN^ 

LKKFIVSSDFGFRNDPFTGNKSFHTGIDLAAPMNAEVYLLLLE 



flOl.nt 

ATGAGTAAAATTTTTTTATTATTTAATGCAGGTTTCTTTTTTTTAAAAATAATTTATGTTTTTTCTTATCCAGAAA 
TAAAAAATTTCTCAAGGCAAGATCCTGTTTTTTCTGATCTTAAAATTAAAGTTTTAAAATATAACAAAAAACAACA 
TATTCCTCTGTTTTTTTACTCATATAAAGTTAAAAAAGGGGATACTTTTTTTAAAATTGCCAATAAAATAAATGGA 
TGGCAGTCCGGCATTGCTACTATTAATTTATTAGATTCTCCTGCTGTGAGTGTTGGGCAAGAGATTCTTATTCCCA 
GTAAAAAAGGAGTTTTTGTTTTTGATAGTAAAGATTATAGATTTAATAATTTGCTTTTAGCAACAAGGGATCTTGC 
TAAAGCTGAAAAGGTAAAAATTAAAAGGAACGACAGAGTTTATGAATTTTATTTTTTTGATTTTGTTAAGAATCCA 
GATTTTGGACTTTTTTCAGGCACAGAATTGCTTTTTTTCTTAAATGCCAATTTTATTTTTCCTTTAAAAAAATTTA 
TTGTTAGTTCTGATTTTGGATTTAGAAATGACCCTTTCACTGGCAACAAAAGTTTCCATACAGGAATAGATCTTGC 
AGCTCCAATGAATGCTGAAGTGTATCTTCTTCTTCTGGAATAG 



tlOl.nt 

TCTTATCCAGAAATAAAAAATTTCTCAAGGCAAGATCCTGTTTTTTCTGATCTTAAAATTAAAGTTTTAAAATATA 
ACAAAAAACAACATATTCCTCTGTTTTTTTACTCATATAAAGTTAAAAAAGGGGATACTTTTTTTAAAATTGCCAA 
TAAAATAAATGGATGGCAGTCCGGCATTGCTACTATTAATTTATTAGATTCTCCTGCTGTGAGTGTTGGGCAAGAG 
ATTCTTATTCCCAGTAAAAAAGGAGTTTTTGTTTTTGATAGTAAAGATTATAGATTTAATAATTTGCTTTTAGCAA 
CAAGGGATCTTGCTAAAGCTGAAAAGGTAAAAATTAAAAGGAACGACAGAGTTTATGAATTTTATTTTTTTGATTT 
TGTTAAGAATCCAGATTTTGGACTTTTTTCAGGCACAGAATTGCTTTTTTTCTTAAATGCCAATTTTATTTTTCCT 
TTAAAAAAATTTATTGTTAGTTCTGATTTTGGATTTAGAAATGACCCTTTCACTGGCAACAAAAGTTTCCATACAG 
GAATAGATCTTGCAGCTCCAATGAATGCTGAAGTGTATCTTCTTCTTCTGGAATAG 

f 11 .aa 

VKKYIKTIFLISMVYFYCCTTIKINHDYETDFKVLESPSKYINIDVIKATNEYIYIQITNNSLDWKINWQNTS 

NDKIVLKKEDLTI^^^ETGYKNKYREFFIGPKTSFKFKVYPLKIHSKNKNSNNLSSTIKYPSIFKIiNIT 

TINVLITRTTKINITNK 

til .aa 

CCTTIKINHDYETDFKVLESPSKYINIDVIKATNEYIYIQITNNSLDWKINWQNTSUSnSfDKIVL^^ 
GYKNKYREFFIGPKTSFKFKVYPLKIHSKNKNSNNLSSTIKYPSIFKLNITKVGIEAKKTINVLITRTTKINITNK 

fll .nt 

GTGGAAAAATTTCTTTTATTCCAGGAAATGAAAATATTGCAGATCTTGGTTTTCATAAAACTAAGTAGAAATAT^ 

TCAAAAAATACATAAAAACAATATTTCTGATTTCAATGGTTTATTTTTATTGTTGTACGACAATA^ 

TGATTATGAAACTGATTTTAAAGTTCTAGAATCTCCCTCTAAATACATCAATATAGATGTAAOT 

GAATATATTTATATTCAAATTACAAACAATAGCTTAGACGTAGTAAAAATAAATTGGCAAAACACTAGTCT^ 

ACGATAAGATCGTCTTAAAAAAAGAAGATCTTACAATAAACAATGAAACAGGGTATAAAAATAAATACAGAGAGT^ 
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TABLE L Nucleotide and Amino Acid Sequences 

TTTTATTGGTCCTAAAACTTCATTTAAATTTAAAGTATATCCACTAAAAATTCATTCTAAAAACAAAAATAGCAAT 
AACTTAAGCTCAACTATTAAATATCCGTCTATTTTTAAGCTCAACATAACAAAAGTAGGAATTGAAGCAAAAAAAA 
CAATAAATGTTTTAATAACAAGAACTACAAAAATTAATATTACTAATAAATGA 

til .nt 

TGTTGTACGACAATAAAAATAAACCATGATTATGAAACTGATTTTAAAGTTCTAGAATCTCCCTCTAAATA^ 

ATATAGATGTAATTAAAGCTACAAATGAATATATTTATATTCAAATTACAAACAATAGCTTAGACGTAGTAAAAAT 

AAATTGGCAAAACACTAGTCTTAACAACGATAAGATCGTCTTAAAAAAAGAAGATCTTACAATAAACAATGAAACA 

GGGTATAAAAATAAATACAGAGAGTTTTTTATTGGTCCTAAAACTTCATTTAAATTTAAAGTATATCCACTAAAAA 

TTCATTCTAAAAACAAAAATAGCAATAACTTAAGCTCAACTATTAAATATCCGTCTATTTTTAAGCTCAACATAAC 

AAAAGTAGGAATTGAAGCAAAAAAAACAATAAATGTTTTAATAACAAGAACTACAAAAATTAATATTACTAAT/^ 

TGA 

fl2 .aa 

MREFLYRNVFKKSFIVFLIFLTFSNAIFAQTIDDENSKKRDKLTLSQKSYLRELELSTDEDLKKWALKEGLKETDV 

SKIRELLLKKFGIDPELFIKGKGLAGSGRYKIIIETADNLENFTYGLTKDESIIFEGRVNILVEDIKENKKHNIKG 

DRIVIJSTKNSKKLYAIGNVEYILDMDTNEKLYFYGNEFLVDFDSQNFLLKNGILQKKMQKNQIDHILSFGGKVLKKI 

DNDWILEQAFATTSKIPEPYYSIKASKIWALPSGDFGFLNAIFYMGRVPVFYIPFFFRPGDSLFFNPSLGIJSrPRK 

GFSWNTVYLFGNKSSSEDSSFLDFDFNSVYNSGKKPYIRNGYLTYFFAENLAPSWKDYVKLIFDIYANI/SFYSG 

IDFNLGNTLGHFKTLEGNFGLGFTRNVYSYDGGYYPFDNRTLKQSLFSFSNLNKGDVFGFEVPFRYLFKFKTEFLL 

SDALFSWLEHYSDPYVNIDFRDRIESATFFSLLNLDKDSWEQTSISTFDVmLSSFYKRTFNIXSSILDYKLNN 

LSFKLSGYENLYVKSPLEKPKDVNDPTRKWFYLERIYAPYIDLNFQKDLYlSnNrQWTFPADTKEMIM^ 

NDKKSVKEKNTKKTTELTKDLYIPPEPITLKNIDQSDSFFIRFGINPYLRNNVFFDl^GITSPKDFNYEIKNYLF 

IKNKTDIKIHADFYNRLITFENLLYLNTIEYSPLNKDFKVEDKDKKSEHSIINQINLNLLPFIRYPLFSRSTLKFE 

NKATLYSFNKKYDSDVKSLVNKNSSIFLSDPETFYQSLTASLIYDYDYFTTELSGELJCNSFEDIKASSELKLSLDF 

PYLLQEAGIGIKYYKKFKEDAMKNSGISAVQSPLEPQKPSSPYKNLEMSPALYYKIEPRYLDYFKFSFLVAYDPLI 

imVSELSFKLNVF-DFQFLFAMKDDFEYNYDPLKGDFSKIGTTTKLVPYSLDSSYKKELYVLTFFDJ^ 

GWKINLQKFTDNELRSALTLKFKYTEFLEIYFSTLSINTKTFKYFKGYMDQIGLEPVNFFVDLSKSFNFFNSQDRK 

DSLFKIKKFSSGFKFNFYDWKFVGEYNLEPDLLRGSDGIYSPIWRNNFTIYISWNFFAPIKASFENNKIDTN^ 

NRKTKK 

tl2 .aa 

IFAQTIDDENSKKRDKLTLSQKSYLRELELSTDEDLKKWALKEGLKETDVSKIRELLLKKFGIDPELFIKGKGLAG 

SGRYKI 1 1 ETADNLENFTYGLTKDES 1 1 FEGRVNI LVEDIKENKKHNI KGDRI VLNKNSKKLYAIGNVEYI LDMDT 

NEIKLYFYGNEFLVDFDSQNFLLKNGILQKKMQKNQIDHILSFGGKVLKKIDNDVTILEQAFATTSKIPEPYYSIK 

ASKIWALPSGDFGFLNAIFYMGRVPVFYIPFFFRPGDSLFFNPSLGLNPRKGFSVFNTVYLFGNKSSSEDSSFLDF 

DFNSVYNSGKKPYIRNGYLTYFFAENLAPSWKDYVKLIFDIYANU3FYSGIDFNLGNTLGHFKTLEGNFGLGFTR 

NVYSYDGGYYPFDNRTLKQSLFSFSNLNKGDVFGFEVPFRYLFKFKTEFLLSDALFSWLEHYSDPYVNIDF^ 

ESATFFSLLNLDKDSVKEQTSISTFDWNLSSFYKRTFNIXSSILDYKIJ^mLGLSFKLSGYENLYVKSPLE^ 

PTRKWFYLERIYAPYIDLNFQKDLYNNQWTFPADTKEMIMRPEIKNLEDKDNDKKSVKEKNTKJKTTELTKI)L^ 

EPITLKNIDQSDSFFIRFGINPYLRNNVFFDNYGITSPKDFNYEIKIJYLFDIKNK^ 

LNTIEYSPLNKDFKVEDKDKKSEHSIINQINI^LLPFIRYPLFSRSTLKFENKATLYSFNKKYDSDVKSLVNK^ 

IFLSDPETFYQSLTASLIYDYDYFTTELSGELKNSFEDIKASSELKLSLDFPYLLQEAGIGIKYYKKFKEDAMKNS 

GISAVQSPLEPQKPSSPYKNLEMSPALYYKIEPRYLDYFKFSFLVAYDPLINRVSELSFKLNVFDFQFLFAMKDDF 

EYNYDPLKGDFSKIGTTTKLVPYSLDSSYKKELYVLTFFDNKLSFTLGVDVGWKINLQKFTDNELRSALTLKFKYT 

EFLEIYFSTLSINTKTFKYFKGYMDQIGLEPVNFFVDLSKSFNFFNSQDRKDSLFKIKKFSSGFKFNFYDWKFVGE 

Yl^EPDLIJ^GSDGIYSPIWRNNFTIYISV^FFAPIKASFE^^SrKDTNYEFIINI^ 

fl2 -nt 

ATGCGAGAATTCCTATACAGGAATGTTTTTAAAAAATCTTTTATAGTATTTTT^ 

CAATTTTTGCCCAGACTATAGATGATGAAAATTCTAAAAAAAGGGATAAGCTAACTTTAAGTCAAAAATCOT 
AAGAGAACTTGAGCTTTCAACCGATGAGGATTTAAAAAAATGGGCCTTAAAAGAGG^ 
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TABLE 1. Nucleotide and Amino Acid Sequences 

TCAAAAATACGAGAATTGCTTTTAAAAAAGTTTGGAATAGATCCTGAGCTTTTTATCAAAGGAAAGGGACTTGCCG 

GATCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTGAAAATTTCACTTATGGACTTACTAAAGATGA 

AAGTATTATTTTTGAAGGAAGAGTTAATATCTTGGTTGAAGATATTAAAGAAAATAAAAAGCACAATATTAAAC^ 

GACAGAATAGTCCTTAATAAGAACTCTAAAAAACTTTATGCTATTGGAAATGTTGAATATATTCT^^ 

CCAATGAAT^GCTTTATTTTTATGGCAATGAATTTCTTGTCGATTTTGATTCTCAAAATTTTTTATTAAAAAAT^ 

TATTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATCATATTCTTTCGTTTGGAGGAAAGGTTTTAAAAAAGATA 

GACAATGATGTTACCATTTTGGAACAAGCTTTTGCAACAACTAGTAAAATTCCAGAGCCTTACTATTC 

CTTCTAAAATATGGGCATTGCCCTCGGGAGATTTTGGGTTTTTAAATGCCATATTTTACATGGGAAGAG^ 

ATTTTATATTCCTTTTTTTTTCAGACCGGGAGATAGTTTGTTTTTTAATCCATCTTTAGGTCTAAATCCACGAAAA 

GGTTTTTCTGTTTTTAATACCGTTTATCTTTTTGGTAATAAATCTTCAAGTGAAGATTCTTCTT 

ATTTCAATTCTGTTTATAATTCGGGTAAAAAACCTTATATAAGAAATGGATATTTAACTTATTTTTTTGCAGAAAA 

TTTAGCACCCAGTGTTAATAAAGATTATGTTAAGCTTATTTTTGACATTTATGCTAATCTGGGATTTTATTCTGGA 

ATTGATTTTAATTTGGGCAATACTTTGGGGCATTTTAAAACTTTGGAAGGAAATTTTGGATTGGGTTTTACCAGGA 

ATGTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTTTAAAACAATCTCTTTTTAGTTTTTCCAA 

TCTTAACAAAGGAGATGTATTTGGGTTTGAAGTTCCTTTTAGATATTTATTTAAATTTAAAACAGAATTTCTTTTA 

AGTGATGCACTTTTCTCGGTTGTTTTAGAGCACTATTCTGACCCGTATGTTAATATTGATTTTAGAGATAGGATAG 

AAAGTGCTACATTTTTTTCTCTTTTAAATTTAGATAAAGATTCGGTTAAAGAGCAAACTAGCATTAGCACTTTTGA 

TTGGAATTTATCTTCTTTTTATAAGCGAACATTTAATGACGGTTCGATTTTAGATTATAAATTAAATAATTTAGGT 

TTAAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAATCTCCTTTAGAGAAACCAAAAGATGTTAATGATC 

CTACAAGAAAATGGTTTTATTTGGAGAGAATTTATGCTCCATATATTGATTTGAATTTTCAAAAAGATCTTTACAA 

TAACCAATGGACATTTCCAGCTGATACTAAAGAAATGATAATGCGCCCAGAAATTAAAAATCTAGAAGATAAAGAT 

AATGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACAACAGAATTAACCAAAGATTTATATATTCCTCCAG 

AACCAATTACTTTAAAAAATATTGATCAATCCGATTCTTTTTTTATTAGGTTTGGCATTAATCCTTATTTAAGAAA 

TAATGTTTTTTTTGATAATTATGGCATAACAAGTCCAAAGGACTTTAATTATGAAATAAAAAATTATTTATTTC 

ATAAAAAATAAAACGGATATAAAAATTCATGCTGATTTTTACAATCGTTTAATTACTTTTGAAAATTTATTATATC 

TTAATACTATTGAGTATAGTCCTTTAAATAAAGATTTTAAAGTTGAAGATAAAGATAAAAAAAGTGAGCACTCTAT 

TATTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATCCTTTATTTTCTAGAAGTACTTTAAAGTTTGAA 

AATAAGGCTACTTTATATTCATTTAATAAAAAATATGATTCTGATGTAAAATCTTTGGTTAATAAGAATAGTAGTA 

TTTTTTTATCTGATCCGGAAACTTTTTATCAAAGTTTAACAGCCTCTTTAATTTATGATTATGATTATTTTACTAC 

TGAGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGATATTAAAGCTTCTTCTGAGCTTAAACTTTCTTTAGATTTT 

CCTTATTTGCTACAAGAAGCTGGGATTGGAATTAAATATTATAAAAAGTTTAAAGAAGATGCTATGAAAAACTCTG 

GAATTTCTGCTGTTCAAAGTCCTTTGGAGCCTCAAAAACCATCATCGCCTTATAAAAATTTAGAAATGTCTCCTGC 

TTTGTATTATAAAATTGAGCCGAGATATTTGGATTATTTTAAATTTAGTTTTTTAGTCGCCTATGATCCTTTGATA 

AATAGAGTTTCTGAACTTTCTTTTAAGCTTAATGTTTTTGATTTTCAATTTTTGTTTGCTATGAAAGACGACTTTG 

AATATAATTATGATCCTTTAAAAGGAGATTTTTCCAAGATTGGTACTACAACCAAACTTGTTCCATATTCTTTAGA 

TTCTAGTTACAAAAAGGAATTGTACGTTTTAACTTTTTTTGACAATAAGCTTTCTTTTACCTTGGGGGTAGATGTT 

GGTTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTCGATCTGCATTGACTTTGAAGTTTAAATATACAG 

AATTTTTAGAAATTTACTTTTCTACTTTATCTATTAATACTAAGACTTTTAAATATTTTAAAGGGTATATGGACCA 

AATTGGTCTAGAACCTGTTAATTTCTTTGTTGATTTATCAAAATCTTTCAATTTCTTTAATTCTCAAGACAGAAAA 

GATTCACTTTTTAAAATTAAAAAATTTTCATCAGGCTTTAAATTCAATTTTTATGATTGGAAATTTGTTGGAGAAT 

ATAATTTAGAACCAGATTTATTAAGGGGATCTGATGGGATTTATTCTCCTATTTGGAGAAATAATTTTACAATT^ 

TATTTCTTGGAACTTTTTTGCTCCTATAAAAGCGTCATTTGAAAACAACAAAGATACAAACTACGAGTTTATTAOT 

AATAGAAAAACAAAAAAATAA 

tl2.nt 

ATTTTTGCCCAGACTATAGATGATGAAAATTCTAAAAAAAGGGATAAGCTAACTTTAAGTCAAAAATCTTATTTAA 

GAGAACTTGAGCTTTCAACCGATGAGGATTTAAAAAAATGGGCCTTAAAAGAGGGTTTAAAAGAAACAGATGTTTC 

AAAAATACGAGAATTGCTTTTAAAAAAGTTTGGAATAGATCCTGAGCTTTTTATCAAAGGAAAGGGACTTGCCG 

TCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTGAAAATTTCACTTATGGACTTACTAAAGATGAAA 

GTATTATTTTTGAAGGAAGAGTTAATATCTTGGTTGAAGATATTAAAGAAAATAAAAAGCACAATA 

CAGAATAGTCCTTAATAAGAACTCTAAAAAACTTTATGCTATTGGAAATGTTGAATATATTCTTGATATC^ 

AATGAAAAGCTTTATTTTTATGGCAATGAATTTCTTGTCGATTTTGATTCTCAAAATTl^ 

TTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATCATATTCTTTCGTTTGGAC^^ 

CAATGATGTTACCATTTTGGAACAAGCTTTTGCAACAACTAGTAAAATTC^ 

TCTAAAATATGGGCATTGCCCTCGGGAGATTTTGGGTTTTTAAATGCCAT^ 
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TABLE 1 . Nucleotide and Amino Acid Sequences 

TTTATATTCCTTTTTTTTTCAGACCGGGAGATAGTTTGTTTTTTAATCCATCTTTAGGTCTAAATCCACGAA^ 

TTTTTCTGTTTTTAATACCGTTTATCTTTTTGGTAATAAATCTTCAAGTGAAGATTCTTCTTTTTTGGATTTTGAT 

TTCAATTCTGTTTATAATTCGGGTAAAAAACCTTATATAAGAAATGGATATTTAACTTATTTTTTTGCAGAAAAO^ 

TAGCACCCAGTGTTAATAAAGATTATGTTAAGCTTATTTTTGACATTTATGCTAATCTGGGATTTTATTCTGGAAT 

TGATTTTAATTTGGGCAATACTTTGGGGCATTTTAAAACTTTGGAAGGAAATO 

GTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTTTAAAACAATCTCTTTTTAGTTTTTCCAATC 

TTAACAAAGGAGATGTATITGGGTTTGAAGTTCCTTTTAGATATTTATTTAAATTTAAAACAGAATTTCTTTTAA 

TGATGCACTTTTCTCGGTTGTTTTAGAGCACTATTCTGACCCGTATGTTAATATTGATTTTAGAGATAGGATAGAA 

AGTGCTACATTTTTTTCTCTTTTAAATTTAGATAAAGATTCGGTTAAAGAGCAAACTAGCATTAGCACTTTTGATT 

GGAATTTATCTTCTTTTTATAAGCGAACATTTAATGACGGTTCGATTTTAGATTATAAATTAAATAATTTAGGTTT 

AAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAATCTCCTTTAGAGAAACCAAAAGATGTTAATGATCCT 

ACAAGAAAATGGTTTTATTTGGAGAGAATTTATGCTCCATATATTGATTTGAATTTTCAAAAAGATCTTTACAATA 

ACCAATGGACATTTCCAGCTGATACTAAAGAAATGATAATGCGCCCAGAAATTAAAAATCTAGAAGATAAAGATAA 

TGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACAACAGAATTAACCAAAGATTTATATATTCCTCCAGAA 

CCAATTACTTTAAAAAATATTGATCAATCCGATTCTTTTTTTATTAGGTTTGGCATTAATCCTTATTTAAGAAATA 

ATGTTTTTTTTGATAATTATGGCATAACAAGTCCAAAGGACTTTAATTATGAAATAAAAAATTATTTATTTGATAT 

AAAAAATAAAACGGATATAAAAATTCATGCTGATTTTTACAATCGTTTAATTACTTTTGAAAATTTATTATATCTT 

AATACTATTGAGTATAGTCCTTTAAATAAAGATTTTAAAGTTGAAGATAAAGATAAAAAAAGTGAGCACTCTATTA 

TTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATCCTTTATTTTCTAGAAGTACTTTAAAGTTTGAAAA 

TAAGGCTACTTTATATTCATTTAATAAAAAATATGATTCTGATGTAAAATCTTTGGTTAATAAGAATAGTAGTATT 

TTTTTATCTGATCCGGAAACTTTTTATCAAAGTTTAACAGCCTCTTTAATTTATGATTATGATTATTTTACTACTG 

AGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGATATTAAAGCTTCTTCTGAGCTTAAACTTTCTTTAGATTTTCC 

TTATTTGCTACAAGAAGCTGGGATTGGAATTAAATATTATAAAAAGTTTAAAGAAGATGCTATGAAAAACTCTGGA 

ATTTCTGCTGTTCAAAGTCCTTTGGAGCCTCAAAAACCATCATCGCCTTATAAAAATTTAGAAATGTCTCCTGCTT 

TGTATTATAAAATTGAGCCGAGATATTTGGATTATTTTAAATTTAGTTTTTTAGTCGCCTATGATCCTTTGATAAA 

TAGAGTTTCTGAACTTTCTTTTAAGCTTAATGTTTTTGATTTTCAATTTTTGTTTGCTATGAAAGACGACTTTGAA 

TATAATTATGATCCTTTAAAAGGAGATTTTTCCAAGATTGGTACTACAACCAAACTTGTTCCATATTCTTTAGATT 

CTAGTTACAAAAAGGAATTGTACGTTTTAACTTTTTTTGACAATAAGCTTTCTTTTACCTTGGGGGTAGATGTTGG 

TTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTCGATCTGCATTGACTTTGAAGTTTAAATATACAGAA 

TTTTTAGAAATTTACTTTTCTACTTTATCTATTAATACTAAGACTTTTAAATATTTTAAAGGGTATATGGACCAAA 

TTGGTCTAGAACCTGTTAATTTCTTTGTTGATTTATCAAAATCTTTCAATTTCTTTAATTCTCAAGACAGAAAAGA 

TTCACTTTTTAAAATTAAAAAATTTTCATCAGGCTTTAAATTCAATTTTTATGATTGGAAATTTGTTGGAGAATAT 

AATTTAGAACCAGATTTATTAAGGGGATCTGATGGGATTTATTCTCCTATTTGGAGAAATAATTTTACAATTTATA 

TTTCTTGGAACTTTTTTGCTCCTATAAAAGCGTCATTTGAAAACAACAAAGATACAAACTACGAGTTTATTATTAA 

TAGAAAAACAAAAAAATAA 

fl29.aa 

MTKKLFVRVLIFLISNNYAFAKDTIKDLFFIQDILIKKEKYSEVLNNASLEGIIEIEHNGPYIKDHDSEVK^ 

NGYRimFNFFNLLNTSNIIKSLSLFDSRPKNIKENEIILLETKMIKENPYKRYKDDDDFELKLSVTRKN^ 

DFNFLFDQRKTFPSIYIKEEDVSTIINSFMKLQDSSFLSPQAS 

tl29.aa 

KDTIKDLFFIQDILIKKEKYSEVIiNNASLEGIIEIEHNGPYIKDHDSEVKLILKENGYRRNFNFFNLLNTSNIIKS 
LSLFDSRPKNIKENEIILLETKMIKENPYKRYKDDDDFELKLSVTRKNNQIYLILDFNFLFDQRKTFPSIYIKEED 
VSTIINSFMKLQDSSFLSPQAS 

fl29.nt 

ATGACAAAAAAATTGTTTGTGAGGGTATTAATCTTTTTAATATCCAAT/^TTATGCTTTTGCAAAAGACACAATCA 

AAGATTTGTTCTTTATACAAGATATACTAATAAAAAAAGAGAAATATTCCGAGGTTCTAAATAATGCAAGCCTTGA 

AGGCATTATTGAAATTGAACATAACGGACCATACATTAAAGATCACGATTCAGAAGTTAAACTTATCCTAAAAGAA 

AACGGATATAGAAGAAATTTCAACTTTTTTAATCTTTTAAATACTAGTAATATAATCAAAAGTCTAAGCTTAT^ 

ACAGCAGACCAAAAAACATTAAAGAAAATGAAATCATATTATTAGAGACAAAAATGATTAAAGAAAATCCCTATAA 

ACGATACAAAGACGATGATGATTTTGAATTAAAACTAAGTGTAACTCGAAAAAATAATCAAATTTATTTAAT^ 
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GATTTCAATTTCCTATTTGATCAAAGAAAAACGTTTCCATCAATTTACATCAAAGAAGAAGATGTATCAACAATAA 
TAAACAGCTTCATGAAACTACAAGATTCAAGCTTTTTATCTCCTCAAGCTTCTTAA 

tl29.nt 

AAAGACACAATCAAAGATTTGTTCTTTATACAAGATATACTAATAAAAAAAGAGAAATATTCCGAGGTTCTAAATA 
ATGCAAGCCTTGAAGGCATTATTGAAATTGAACATAACGGACCATACATTAAAGATCACGATTCAGAAGTTAAACT 
TATCCTAAAAGAAAACGGATATAGAAGAAATTTCT^CTTTTTTAATCTTTTAAATACTAGTAATATAATCAAAAGT 
CTAAGCTTATTTGACAGCAGACCAAAAAACATTAAAGAAAATGAAATCATATTATTAGAGACAAAAATGATTAAAG 
AAAATCCCTATAAACGATACAAAGACGATGATGATTTTGAATTAAAACTAAGTGTAACTCGAAAAAATAATCAAAT 
TTATTTAATTCTTGATTTCAATTTCCTATTTGATCAAAGAAAAACGTTTCCATCAATTTACATCAAAGAAGAAGAT 
GTATCAACAATAATAAACAGCTTCATGAAACTACAAGATTCAAGCTTTTTATCTCCTCAAGCTTCTTAA 

fl42.aa 

MDKISILYTLINIIIMLILISIWLCKRKWSFTKRVFIALAIGIVFGMTIQYFYGTNSEITNETINWISILGDG 

VRLLKMI IIPLIITSIISAII KLTNS KDVGKMS LLVI LTL VFTAG I AAI I G I FTALALGLTAEGLQAGT I E I LQS E 

KLQKGLEILNQTTITKKITDLIPQNIFEDFAGLRKNSTIGWIFSAIIGIAALKTSIKKPESIEFFKKIILTLQDI 

ILGVWLILKLTPYAILAIJyiTKITATSEIKSIIKLGEFVIASYIAIGLTFLMHMTLIAINKLNPITFIKKIFPALS 

FAFISRSSAATIPINIEIQTKNLGVSEGIANLSSSFGTSIGQNGCAALHPAMLAIMIAPTQGINPTDISFILTLIG 

L 1 1 ITSFGAAGAGGGATTAS LMVLSAMNF PVGLVGLVI S VEP 1 1 DMGRTAVNVGGSMLAGVI SAKQLKQFNHNI YW 

QKELVNK 

tl42.aa 

CKRKNVSFTKRVFIALAIGIVFGMTIQYFYGTNSEITNETINWISILGDGYVRLLKMIIIPLIITSIISAIIKLTN 
SKDVGKMSLLVILTLVFTAGIAAIIGIFTALALGLTAEGLQAGTIEILQSEKLQKGLEILNQTTITKKITDLIPQN 
IFEDFAGLRKNSTIGWIFSAIIGIAALKTSIKKPESIEFFKKIILTLQDIILGWTLILKLTPYAILALMTKITA 
TSEIKSIIKLGEFVIASYIAIGLTFLMHMTLIAINKLNPITFIKKIFPALSFAFISRSSAATIPINIEIQTKNLGV 
SEGIANLSSSFGTSIGQNGCAALHPAMLAIMIAPTQGINPTDISFILTLIGLIIITSFGAAGAGGGATTASLMVLS 
AMNFPVGLVGLVI SVEPI IDMGRTAVNVGGSMLAGVI SAKQLKQFNHNI YNQKELVNK 



fl42.nt 

TAAGAGGTAATAATGGATAAAATAAGTATATTATATACATTAATCAATATTATAATAATGCTTATTCTAATAAGCA 

TAGTTTATCTTTGTAAAAGAAAAAATGTTTCTTTTACAAAAAGAGTGTTTATAGCGTTAGCAATCGGAATAGTATT 

TGGAATGACCATTCAATATTTTTATGGAACAAATTCAGAAATAACAAACGAAACTATAAATTGGATAAGTATTTTG 

GGCGATGGATACGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACATCAATAATCTCTGCAATAATAA 

AACTAACCAATAGTAAAGATGTTGGGAAAATGAGCCTACTTGTAATATTAACACTAGTATTTACAGCAGGTATTGC 

TGCCATAATTGGCATTTTCACTGCTTTAGCATTGGGATTAACAGCCGAAGGACTACAAGCGGGAACCATCGAAATT 

TTACAAAGTGAAAAATTGCAAAAAGGCCTTGAAATATTAAATCAAACAACAATCACAAAAAAAATCACAGATCTTA 

TTCCACAAAATATATTTGAAGATTTTGCAGGGCTTAGAAAAAACTCAACCATCGGGGTCGTGATATTTTCAGCTAT 

CATAGGAATAGCCGCCCTTAAAACATCTATCAAAAAGCCAGAATCAATAGAATTTTTTAAAT^AAATAATATTAACA 

CTCCAAGACATAATATTAGGTGTAGTAACTTTGATTTTAAAACTAACGCCTTATGCTATATTAGCTTTAATGACAA 

AAATTACAGCAACCAGCGAAATCAAAAGCATAATAAAGCTTGGAGAATTTGTAATTGCTTCCTACATTGCCATAGG 

TCTTACATTTCTTATGCATATGACATTAATTGCAATAAATAAATTAAACCCAATTACTTTTATAAAAAAAATATTC 

CCAGCACTATCATTTGCATTCATATCTAGGTCGAGTGCTGCAACCATACCCATTAATATAGAAATTCAAACTAAAA 

ATCTGGGAGTAAGCGAAGGAATAGCAAATTTATCAAGCTCCTTTGGAACATCAATTGGGCAAAATGGTTGTG 

ACTACACCCCGCTATGCTTGCAATAATGATAGCACCAACTCAGGGAATAAACCCCACAGATATTTCATTTATACTC 

ACACTTATTGGATTAATAATAATAACTTCATTTGGAGCTGCTGGCGCTGGTGGAGGCGCAACA^ 

TGGTGCTCTCAGCAATGAACTTTCCAGTGGGATTGGTAGGACTTGTAATATCTGTTGAGCCTATAATTGACATGGG 

AAGAACAGCTGTTAATGTAGGCGGCTCAATGCTTGCAGGCGTTATATCTGCTAAACAGCTCAAACAATTCA^ 

AATATATACAACCAAAAAGAGCTTGTAAACAAATAA 



tl42 .nt 
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TGTAAAAGAAAAAATGTTTCTTTTACAAAAAGAGTGTTTATAGCGTTAGCAATCGGAATAGTATTTGGAATGACCA 
TTCAATATTTTTATGGAACAAATTCAGAAATAACAAACGAAACTATAAATTGGATAAGTATTTTGGGCGATGGATA 
CGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACATCAATAATCTCTGCAATAATAAAACTAACCAAT 
AGTAAAGATGTTGGGAAAATGAGCCTACTTGTAATATTAACACTAGTATTTACAGCAGGTATTGCTGCCATAATTG 
GCATTTTCACTGCTTTAGCATTGGGATTAACAGCCGAAGGACTACAAGCGGGAACCATCGAAATTTTACAAAGTGA 
AAAATTGCAAAAAGGCCTTGAAATATTAAATCAAACAACAATCACAAAAAAAATCACAGATCTTATTCCACAAAAT 
ATATTTGAAGATTTTGCAGGGCTTAGAAAAAACTCAACCATCGGGGTCGTGATATTTTCAGCTATCATAGGAATAG 
CCGCCCTTAAAACATCTATCAAAAAGCCAGAATCAATAGAATTTTTTAAAAAAATAATATTAACACTCCAAGACAT 
AATATTAGGTGTAGTAACTTTGATTTTAAAACTAACGCCTTATGCTATATTAGCTTTAATGACAAAAATTACAGCA 
ACCAGCGAAATCAAAAGCATAATAAAGCTTGGAGAATTTGTAATTGCTTCCTACATTGCCATAGGTCTTACATTTC 
TTATGCATATGACATTAATTGCAATAAATAAATTAAACCCAATTACTTTTATAAAAAAAATATTCCCAGCACTATC 
ATTTGCATTCATATCTAGGTCGAGTGCTGCAACCATACCCATTAATATAGAAATTCAAACTAAAAATCTGGGAGTA 
AGCGAAGGAATAGCAAATTTATCAAGCTCCTTTGGAACATCAATTGGGCAAAATGGTTGTGCAGCACTACACCCCG 
CTATGCTTGCAATAATGATAGCACCAACTCAGGGAATAAACCCCACAGATATTTCATTTATACTCACACTTATTGG 
ATTAATAATAATAACTTCATTTGGAGCTGCTGGCGCTGGTGGAGGCGCAACAACAGCCTCACTAATGGTGCTCTCA 
GCAATGAACTTTCCAGTGGGATTGGTAGGACTTGTAATATCTGTTGAGCCTATAATTGACATGGGAAGAACAGCTG 
TTAATGTAGGCGGCTCAATGCTTGCAGGCGTTATATCTGCTAAACAGCTCAAACAATTCAACCATAATATATACAA 
CCAAAAAGAGCTTGTAAACA7VATAA 

fl47 .aa 

MKIIIIGGTSAGTSAAAKANRLNKKLDITIYEKTNIVSFGTCGLPYFVGGFFDNPNTMISRTQEEFEKTGISVKTN 
HEVIK\mAKNNTIVIKNQKTGTIFNNTYDQLMIATGAKPIIPPINNINLENFHTLKNLEDGQKIKKLMDREEIKNI 
VIIGGGYIGIEMVEAAKNKRKNVRLIQLDKHILIDSFDEEIVTIMEEELTKKGVNLHTNEFVKSLIGEKKAEGVVT 
NKNTYQADAVILATGIKPDTEFLENQLKTTKNGAIIVNEYGETSIKNIFSAGDCATIYNIVSKKNEYIPLATTANK 
LGRIVGENLAGNHTAFKGTLGSASIKILSLEAARTGLTEKDAKKLQIKYKTIFVKDKNHTNYYPGQEDLYIKLIYE 
ENTKIILGAQAIGKNGAVIRIHALSIAIYSKLTTKELGMMDFSYSPPFSRTWDILNIAGNAAK 

tl47 .aa 

AAAKANRLNKKLDITIYEKTNIVSFGTCGLPYFVGGFFDNPNTMISRTQEEFEKTGISVKTNHEVIKVDAKNNTIV 

IKNQKTGTIFNNTYIX^IiMIATGAKPIIPPINNINLENFHTLKNLEIXSQKIKKLMDREEIKNIVIIGG 

AAKNKRKNVRLIQLDKHILIDSFDEEIWIMEEELTKKGVNLHTNEFVKSLIGEKKAEGWTNKNTYQ^ 

GIKPDTEFLENQLKTTKNGAIIVNEYGETSIKNIFSAGDCATIYNIVSKKNEYIPIJ^TTANKLGRIVGENLAGNHT 

AFKGTLGSASIKILSLEAARTGLTEKDAKKLQIKYKTIFVKDKNHTNYYPGQEDLYIKLIYEENTKIILGAQAIGK 

NGAVIRIHALSIAIYSKLTTKELGMMDFSYSPPFSRTWDILNIAGNAAK 

fl47 .nt 

ATGAAAATAATAATTATTGGGGGCACATCAGCAGGAACTAGTGCCGCAGCTAAAGCAAACCGCTTAAACAAAAAGC 

TAGACATTACTATCTATGAAAAAACAAATATTGTATCTTTTGGAACCTGTGGCCTGCCTTACTTTGTGGGGGGATT 

CTTTGACAACCCCAATACAATGATCTCAAGAACACAAGAAGAATTCGAAAAAACTGGAATCTCTGTTAAAACTAAC 

CACGAAGTTATCAAAGTAGATGCAAAAAACAATACAATTGTAATAAAAAATCAAAAAACAGGAACCATTTTTAACA 

ATACTTACGATCAACTTATGATAGCAACTGGTGCAAAACCTATTATTCCACCAATCAATAATATCAATCTAGAAAA 

TTTTCATACTCTGAAAAATTTAGAAGACGGTCAAAAAATAAAAAAATTAATGGATAGAGAAGAGATTAAAAATATA 

GTGATAATTGGTGGTGGATACATTGGAATTGAAATGGTAGAAGCAGCAAAAAATAAAAGAAAAAATGTAAGATTAA 

TTCAACTAGATAAGCACATACTCATAGATTCCTTTGACGAAGAAATAGTCACAATAATGGAAGAAGAACTAACAAA 

AAAGGGGGTTAATCTTCATACAAATGAGTTTGTAAAAAGTTTAATAGGAGAAAAAAAGGCAGAAGGAGTAGTAACA 

AACAAAAATACTTATCAAGCTGACGCTGTTATACTTGCTACCGGAATAAAACCTGACACTGAATTTTTAGAAAACC 

AGCTTAAAACTACTAAAAATGGAGCAATAATTGTAAATGAGTATGGCGAAACTAGCATAAAAAATATTTTTT^ 

AGGAGATTGTGCAACTATTTATAATATAGTAAGTAAAAAAAATGAATACATACCCTTGGCAACAACAGCCAACAAA 

CTTGGAAGAATAGTTGGTGAAAATTTAGCTGGGAATCATACAGCATTTAAAGGCACATTGGGCTCAGCTTCAATTA 

AAATACTATCTTTAGAAGCTGCAAGAACAGGACTTACAGAAAAAGATGCAAAAAAGCTCCAAATAAAATATAAAAC 

GATTTTTGTAAAGGACAAAAATCATACAAATTATTATCCAGGCCAAGAAGATCTTTATATTAAATT^ 

GAAAATACCAAAATAATCCTTGGGGCACAAGCAATAGGAAAAAATGGAGCCGTAATAAGAATTCATGCTTTATCAA 
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TTGCAATCTATTCAAAACTTACAACAAAAGAGCTAGGGATGATGGATTTCTCATATTCCCCACCCTTCTCAA 
TTGGGATATATTAAATATTGCTGGCAATGCTGCCAAATAG 

tl47 .nt 

GCCGCAGCTAAAGCAAACCGCTTAAACAAAAAGCTAGACATTACTATCTATGAAAAAACAAATATTGTATCTTTTG 
GAACCTGTGGCCTGCCTTACTTTGTGGGGGGATTCTTTGACAACCCCAATACAATGATCTCAAGAACACAAGAAGA 
ATTCGAAAAAACTGGAATCTCTGTTAAAACTAACCACGAAGTTATCAAAGTAGATGCAAAAAACAATACAATTGTA 
ATAAAAAATCAAAAAACAGGAACCATTTTTAACAATACTTACGATCAACTTATGATAGCAACTGGTGCAAAACCTA 
TTATTCCACCAATCAATAATATCAATCTAGAAAATTTTCATACTCTGAAAAATTTAGAAGACGGTCAAAAAATAAA 
AAAATTAATGGATAGAGAAGAGATTAAAAATATAGTGATAATTGGTGGTGGATACATTGGAATTGAAATGGTAGAA 
GCAGCAAAAAATAAAAGAAAAAATGTAAGATTAATTCAACTAGATAAGCACATACTCATAGATTCCTTTGACGAAG 
AAATAGTCACAATAATGGAAGAAGAACTAACAAAAAAGGGGGTTAATCTTCATACAAATGAGTTTGTAAAAAGTTT 
AATAGGAGAAAAAAAGGCAGAAGGAGTAGTAACAAACAAAAATACTTATCAAGCTGACGCTGTTATACTTGCTACC 
GGAATAAAACCTGACACTGAATTTTTAGAAAACCAGCTTAAAACTACTAAAAATGGAGCAATAATTGTAAATGAGT 
ATGGCGA/^CTAGCATAAAAAATATTTTTTCTGCAGGAGATTGTGCAACTATTTATAATATAGTAAGTAAAAAAAA 
TGAATACATACCCTTGGCAACAACAGCCAACAAACTTGGAAGAATAGTTGGTGAAAATTTAGCTGGGAATCATACA 
GCATTTAAAGGCACATTGGGCTCAGCTTCAATTAAAATACTATCTTTAGAAGCTGCAAGAACAGGACTTACAGAAA 
AAGATGCAAAAAAGCTCCAAATAAAATATAAAACGATTTTTGTAAAGGACAAAAATCATACAAATTATTATCCAGG 
CCAAGAAGATCTTTATATTAAATTAATTTATGAGGAAAATACCAAAATAATCCTTGGGGCACAAGCAATAGGAAAA 
AATGGAGCCGTAATAAGAATTCATGCTTTATCAATTGCAATCTATTCAAAACTTACAACAAAAGAGCTAGGGATGA 
TGGATTTCTCATATTCCCCACCCTTCTCAAGAACTTGGGATATATTAAATATTGCTGGCAATGCTGCCAAATAG 

fl52 .aa 

MLKFEFSDRFLLFSYFVLIMFIGSLLLMLPISWEGDGKLAYIDALFTAVSAVSITGLTTVKMEGFSTFGFILIMLL 
IQLGGLGFISITTFYLLIPKKKMNLTDARIIKQYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFKLRGVNI 
SFLEALFTTISAFCNAGFSMHSESIYAWRDVPEAIVWSILIICGGLGFMVYRDVNNTIKNKKKLSLHAKIVFSLS 
FFLIIIGAILFFFTEMHKLKAGYSMSTLIFNSIFYSISTRTAGFNYLDNSLISGRTQIISLPFMFIGGAPGSTAGG 
IKITTFFLIVIJVVVICNQNGNGYIIGSYKA^SIDSIRFALLFFARAIFILSFSFFMLLFFEGGSGNWKVIDLGYEVFS 
AFGTVGLSVGVTQDLSFWGKVIIIFTMFAGRIGLFSMAVFVSRKSRFEEFTRPRQDILVG 

tl52 .aa 

WEGDGKLAYIDALFTAVSAVSITGLTTVKMEGFSTFGFILIMLLIQLGGLGFISITTFYLLIPKKKMNLTDARIIK 

QYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFKLRGVNISFLEALFTTISAFCNAGFSMHSESIYAWRDVP 

EAIXAA/'SILIICGGLGFMVYRDVNNTIKNKKKLSLHAKIVFSLSFFLIIIGAILFFFTEMHKLKAGYS 

IFYSISTRTAGFNYLDNSLISGRTQIISLPFMFIGGAPGSTAGGIKITTFFLIVLAWKNQNGNGYIIGSYKVSID 

SIRFALLFFARAIFILSFSFFMLLFFEGGSGNWKVIDLGYEVFSAFGTVGLSVGVTQDLSFWGKVIIIFTMFAGRI 

GLFSMAVFVSRKSRFEEFTRPRQDILVG 

fl52.nt 

ATGTTGAAATTTGAATTTAGCGACAGGTTTTTACTTTTTAGTTATTTTGTTTTAATTATGTTTATAGGCTCTCTTT 

TGTTGATGTTGCCTATTTCCTGGGAAGGTGATGGCAAATTAGCATACATTGATGCTCTTTTTACTGCTGTTTCTGC 

TGTAAGTATTACGGGCCTTACAACGGTTAAAATGGAAGGCTTTTCTACTTTTGGATTTATTTTGATAATGTTGCTA 

ATCCAGCTTGGGGGACTTGGATTTATAAGTATTACTACTTTTTATTTGCTTATACCTAAAAAGAAAATGAATTTAA 

CAGATGCAAGAATAATAAAGCAGTATTCCCTTTCAAATATAGAATATAATCCTATTAGAATTTTAAAAAGCATATT 

GTTTATAACTTTTTCAATTGAAATGATAGGTTTAATATTAATACTTATTTGTTTTAAACTTAGGGGAGTGAATATT 

TCATTCTTAGAGGCTTTGTTTACGACAATTTCTGCTTTTTGCAATGCAGGTTTTTCCATGCATTCTGAGAGTATTT 

ATGCATGGCGAGATGTTCCTGAAGCTATAGTTGTGGTCTCTATTTTAATAATTTGTGGTGGGCTTGGGTTTATGGT 

CTATAGAGATGTAAATAACACTATTAAAAACAAAAAAAAACTATCGCTTCATGCCAAGATAGTTTTTTCTTTAAGC 

TTCTTTTTAATTATAATTGGTGCAATTTTATTTTTTTTTACAGAGATGCATAAATTAAAAGCTGGTTATTCAATGA 

GCACTTTAATATTTAATTCAATTTTTTATTCGATTAGTACCAGAACAGCTGGTTTTAATTATCTTGATAATTCT^ 

AATAAGCGGAAGAACTCAAATAATTTCTCTACCATTCATGTTTATTGGTGGTGCACCCGG^ 

ATTAAGATTACAACATTTTTTTTAATTGTATTGGCTGTTGTTAAAAATCAAAACG^ 
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CTTACAAGGTTTCAATAGATAGTATAAGATTTGCACTTTTATTTTTTGCAAGAGCTATTTTTATTTTAAGTTTTTC 

ipipYTTTCATGCTTCTTTTTTTTGAGGGAGGATCTGGCAATTGGAAGGTTATTGATTTAGGTTATC 

GCTTTTGGAACGGTTGGTCTTTCAGTTGGAGTAACTCAGGATTTGTCATTTTGGGGGAAAGTCATTATAATTTTTA 

CTATGTTTGCAGGACGAATAGGGCTTTTTTCAATGGCTGTTTTTGTTTCAAGAAAGTCGCGTTTTGAAGAATTTAC 

AAGGCCAAGGCAAGATATTTTGGTTGGTTGA 

tl52 .nt 

TGGGAAGGTGATGGCAAATTAGCATACATTGATGCTCTTTTTACTGCTGTTTCTGCTGTAAGTATTACGGGCCTTA 
CAACGGTTAAAATGGAAGGCTTTTCTACTTTTGGATTTATTTTGATAATGTTGCTAATCCAGCTTGGGGGACTTGG 
ATTTATAAGTATTACTACTTTTTATTTGCTTATACCTAAAAAGAAAATGAATTTAACAGATGCAAGAATAATAAAG 
CAGTATTCCCTTTCAAATATAGAATATAATCCTATTAGAATTTTAAAAAGCATATTGTTTATAACTTTTTCAATTG 
AAATGATAGGTTTAATATTAATACTTATTTGTTTTAAACTTAGGGGAGTGAATATTTCATTCTTAGAGGCTTTGTT 
TACGACAATTTCTGCTTTTTGCAATGCAGGTTTTTCCATGCATTCTGAGAGTATTTATGCATGGCGAGATGTTCCT 
GAAGCTATAGTTGTGGTCTCTATTTTAATAATTTGTGGTGGGCTTGGGTTTATGGTCTATAGAGATGTAAATAACA 
CTATTAAAAACAAAAAAAAACTATCGCTTCATGCCAAGATAGTTTTTTCTTTAAGCTTCTTTTTAATTATAATTGG 
TGCAATTTTATTTTTTTTTACAGAGATGCATAAATTAAAAGCTGGTTATTCAATGAGCACTTTAATATTTAATTCA 
ATTTTTTATTCGATTAGTACCAGAACAGCTGGTTTTAATTATCTTGATAATTCTTTAATAAGCGGAAGAACTCAAA 
TAATTTCTCTACCATTCATGTTTATTGGTGGTGCACCCGGATCAACTGCAGGAGGGATTAAGATTACAACATTTTT 
TTTAATTGTATTGGCTGTTGTTAAAAATCAAAACGGCAATGGATATATTATTGGTTCTTACAAGGTTTCAATAGAT 
AGTATAAGATTTGCACTTTTATTTTTTGCAAGAGCTATTTTTATTTTAAGTTTTTCTTTTTTCATGCTTCTTTTTT 
TTGAGGGAGGATCTGGCAATTGGAAGGTTATTGATTTAGGTTATGAAGTATTTTCTGCTTTTGGAACGGTTGGTCT 
TTCAGTTGGAGTAACTCAGGATTTGTCATTTTGGGGGAAAGTCATTATAATTTTTACTATGTTTGCAGGACGAATA 
GGGCTTTTTTCAATGGCTGTTTTTGTTTCAAGAAAGTCGCGTTTTGAAGAATTTACAAGGCCAAGGCAAGATATTT 
TGGTTGGTTGA 

fl54.aa 



MKINKTFILLFLFTKFSFVQAQANQILTEISPLSILSKNGKGSVYLKVSKSSDYILTLDKSSNSDFVFKIYDISNK 

KYITDKVKRRDFKIRLDKNSLYAIIYVGTKNENIKFSLTDLDFSILSSDSLKAKTSKIEKEDLFFTLKDLPVLNLT 

AKLKKYVLRIYKSNIYIAYQLENSDDIKVAEFIEDVGWFNLDSSVNRNITNIVNFDFSINSKGNLYIAFVTKSGAD 

FASELIVKKFNSRKWIDISPGHIENFGSLLNISIDLKDRLYLAYLREIRGEYKINLISNMGYGSIWTDVIHAYLSK 

GDSNVNSSNIGLISEPFLGIFYlSn^KSNNEIKSEFIVNNENAWVNANIPSVYMA^ 

PIVNICPLKSSRWINISPWEMEGLSADIGLYKlSn^FLAFEDNNNVRLIYFKN^ 

YGNQGLVISTLSSNSNELFFTLICQ 

tl54.aa 

NQILTEISPLSILSKNGKGSVYLKVSKSSDYILTLDKSSNSDFVFKIYDISNKKYITDKVKRRDFKIRLDKNSLYA 

IIYVGTKNENIKFSLTDLDFSILSSDSLKAKTSKIEKEDLFFTLKDLPVLNLTAKLKKYVLRIYKSNIYIAYQLEN 

SDDIKVAEFIEDVGWFNLDSSVNRNITNIVNFDFSINSKGNLYIAFVTKSGADFASELIVKKFNSRKWIDIS^^ 

ENFGSLLNISIDLKDRLYLAYLREIRGEYKINLISNMGYGSIWTDVIHAYLSKGDSNVNSSNIGLISEPFLGIFYN 

YKSNNEIKSEFIVNNENAWVNANIPSVYMANFIKGFFDSNFNQIIMSFVSENRPIVNICPLKSSRWINISPNVE^ 

GLSADIGLYKNNLFLAFEDNNNVRLIYFKNKNWY 

CQ 

fl54 .nt 



ATGAAAATAAATAAGACATTCATTTTGCTATTTTTATTTACAAAATTTTCTTTTGTTCAAGCTCAAGCAAATCAAA 

TATTAACAGAAATTAGTCCTTTAAGTATTTTAAGCAAAAATGGGAAAGGAAGTGTTTACTTAAAAGTTAGCA 

TTCCGATTATATTTTAACCCTAGATAAGAGTTCAAATTCCGATTTTGTTTTTAAAATTTATGACATO 

AAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAAAATAAGATTAGATAAAAATTCTCTTTATGCAATAATAT 

ATGTTGGTACTAAAAATGAAAACATAAAGTTTTCGCTTACAGATTTAGATT^ 

GAAAGCTAAAACATCTAAGATTGAAAAAGAAGATTTATTTTTTACTTTAAAAGATTTGCCTC 
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GCCAAGCTTAAAAAATATGTATTAAGGATTTATAAAAGCAATATTTATATTGCTTATCAGCTAGAAAATAGCGATG 

ATATTAAAGTTGCTGAATTTATTGAGGATGTTGGTTGGTTTAATCTTGATTCATCTGTTAATAGi^^ 

TATAGTTAATTTTGATTTTTCAATTAATTCTAAAGGAAATTTATATATTGCTTTTGTTACGAAATCAGGGGCTGAT 

TTTGCCAGCGAGCTTATAGTTAAAAAATTTAATAGTAGAAAATGGATTGATATTAGTCCTGGTCACATAGAAAATT 

TTGGATCTTTATTAAATATTAGCATTGATTTAAAAGATAGGTTGTATTTAGCATATTTAAGGGAAATTAGGGGTGA 

ATATAAAATTAATTTAATCTCGAATATGGGTTACGGAAGTATTTGGACCGATGTAATACATGCTTATTTAAGTAAA 

GGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATATCTGAACCTTTTTTGGGCATTTTTTATAATTATAAGT 

CAAATAATGAGATTAAATCTGAATTTATTGTAAACAATGAAAATGCTTGGGTAAATGCAAATATTCCTTCTGTTTA 

TATGGCCAATTTTATTAAAGGCTTTTTTGATTCTAATTTTAATCAAATAATTATGAGTTTTGTTTCTGAAAATAGA 

CCTATTGTAAACATTTGTCCTTTGAAAAGTAGTAGATGGATTAATATAAGTCCTAATGTTGAAATGGAAGGTTTAA 

GTGCTGACATTGGGCTTTATAAAAATAATTTGTTTTTAGCTTTTGAGGACAATAATAATGTGAGATTAATTTATTT 

TAAGAATAAAAATTGGTATTTTTTAAATAAGCTTGAGAATTTTAAGAGTAATGTTAAAAGCCCTCAGATTGGAATT 

TATGGCAATCAAGGGCTTGTAATCTCTACTTTAAGCTCTAATTCCAATGAATTATTTTTTACTTTGATTTGCCAAT 

GA 

tl54 .nt 



AATCAAATATTAACAGAAATTAGTCCTTTAAGTATTTTAAGCAAAAATGGGAAAGGAAGTGTTTACTTAAAAGTTA 
GCAAATCTTCCGATTATATTTTAACCCTAGATAAGAGTTCAAATTCCGATTTTGTTTTTAAAATTTATGACATTTC 
TAATAAAAAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAAAATAAGATTAGATAAAAATTCTCTTTATGCA 
ATAATATATGTTGGTACTAAAAATGAAAACATAAAGTTTTCGCTTACAGATTTAGATTTTTCAATTTTAAGTAGCG 
ATTCCCTGAAAGCTAAAACATCTAAGATTGAAAAAGAAGATTTATTTTTTACTTTAAAAGATTTGCCTGTTTTAAA 
TTTAACTGCCAAGCTTAAAAAATATGTATTAAGGATTTATAAAAGCAATATTTATATTGCTTATCAGCTAGAAAAT 
AGCGATGATATTAAAGTTGCTGAATTTATTGAGGATGTTGGTTGGTTTAATCTTGATTCATCTGTTAATAGAAATA 
TTACTAATATAGTTAATTTTGATTTTTCAATTAATTCTAAAGGAAATTTATATATTGCTTTTGTTACGAAATCAGG 
GGCTGATTTTGCCAGCGAGCTTATAGTTAAAAAATTTAATAGTAGAAAATGGATTGATATTAGTCCTGGTCACATA 
GAAAATTTTGGATCTTTATTAAATATTAGCATTGATTTAAAAGATAGGTTGTATTTAGCATATTTAAGGGAAATTA 
GGGGTGAATATAAAATTAATTTAATCTCGAATATGGGTTACGGAAGTATTTGGACCGATGTAATACATGCTTATTT 
AAGTAAAGGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATATCTGAACCTTTTTTGGGCATTTTTTATAAT 
TATAAGTCAAATAATGAGATTAAATCTGAATTTATTGTAAACAATGAAAATGCTTGGGTA7VATGCAAATATTCCTT 
CTGTTTATATGGCCAATTTTATTAAAGGCTTTTTTGATTCTAATTTTAATCAAATAATTATGAGTTTTGTTTCTGA 
AAATAGACCTATTGTATU^CATTTGTCCTTTGAAAAGTAGTAGATGGATTAATATAAGTCCTAATGTTGAAATGG^ 
GGTTTAAGTGCTGACATTGGGCTTTATAAAAATAATTTGTTTTTAGCTTTTGAGGACAATAATAATGTGAGATTAA 
TTTATTTTAAGAATAAAAATTGGTATTTTTTAAATAAGCTTGAGAATTTTAAGAGTAATGTTAAAAGCCCTCAGAT 
TGGAATTTATGGCAATCAAGGGCTTGTAATCTCTACTTTAAGCTCTAATTCCAATGAATTATTTTTTACTTTGATT 
TGCCT^TGA 



fl57 .aa 



MKIFLKVIGRGILGRIiMVFRKNYDYLALISLLIVSFVGILLIYSSDYNISGSLTKNEYIKQTFWVIIGFFLIFIVG 
KYDLKF VYSM\n" PL YFLL I I^L I FTAF FGMTWGARS WI G I WKLGGQPS EFGKW 

ITAFLLIFPSVILILLQPDFGTAIVYLTIFIFISFFAGIDLHYVLAFALIGFFSFVFAILPVWYEYKV^^^GNVF 
IFSNPFYFRVIMGVLLLILLISVLGFFISKYGLSIKIIYFYVFFASSILLVSIVFSKVLSKLMKTYQIKRFLVFLD 
PAIDAKGAGWNLNQVKIAIGSGGLLGKGFLKGPYTHANYVPSQSTDFIFSILAEEFGFLGVSTILILFFFLFFKFL 
IIMNKSQDRYMALVISGILGLLFFHTSFNVGMSU3VLPITGIPFPFLSYGGSSTITFFLAMSFYFNIESIVAMD 



tl57 .aa 



RKNYDYLALISLLIVSFVGILLIYSSDYNISGSLTKNEYIKQTFWVIIGFFLIFIVGKYDLKFVYSMWPLYFLLI 
LALIFTAFFGMTWGARSWIGIWKIiGGQPSEFGKWIILTLSKFYTEKKGYNEFF^ 

FGTAIVYLTIFIFISFFAGIDLHYVIiAFALIGFFSFWAILPVWYEYKVNMGNVFYLIFSNPFYFRVIMGV^ 
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LISVLGFFlSKYGLSIKIIYFYVFFASSILIiVSIVFSKVLSKLMKTYQIKRFLVFLDPAIDAKGAGWNLNQVKIAI 
GSGGLLGKGFLKGPYTHANYVPSQSTDFIFSILAEEFGFLGVSTILILFFFLFFKFLIIMNKSQDRYMALVISGIL 
GLLFFHTSFNVGMSLGVLPITGIPFPFLSYGGSSTITFFLAMSFYFNIESIVAMD 



fl57.nt 



ATGAAGATATTCTTAAAGGTTATAGGCCGTGGTATATTAGGTAGATTAATGGTTTTTAGAAAAAATTATGATTATT 
TGGCTTTGATAAGCTTACTTATAGTTTCTTTTGTTGGTATATTGTTGATTTATTCTAGCGATTATAATATTAGTGG 
ATCTTTAACCAAGAATGAATATATAAAACAAACCTTTTGGGTAATTATTGGATTTTTTCTAATTTTTATAGTGGGC 
AAATATGATTTAAAATTTGTTTATAGCATGGTATATCCTTTATATTTTTTATTAATATTGGCTTTAATTTTTACTG 
CATTTTTTGGAATGACAGTAAATGGAGCAAGATCTTGGATTGGCATATGGAAACTTGGAGGACAGCCTTCTGAATT 
TGGTAAAGTTGTTATTATTTTAACCCTTTCAAAATTTTACACTGAAAAAAAGGGTTATAATGAATTTTTTACCTTT 
ATTACTGCATTTTTATTAATTTTTCCATCGGTAATTCTTATATTATTGCAACCTGATTTTGGTACAGCAATAGTAT 
ATTTAACCATTTTTATATTTATTTCTTTTTTTGCAGGAATAGATTTGCACTATGTTTTAGCATTTGCGTTGATAGG 
GTTTTTTTCTTTTGTTTTTGCAATTTTACCGGTTTGGTATGAATATAAGGTGAATATGGGTAATGTATTTTATCTT 
ATTTTCTCAAATCCTTTTTATTTTAGAGTAATAATGGGAGTGCTGCTTTTAATTCTTTTGATTTCTGTTTTAGGAT 
TTTTCATTTCTAAATATGGTTTGAGTATTAAAATAATTTATTTTTATGTATTTTTTGCAAGTTCTATTTTATTAGT 
TTCAATAGTGTTTTCAAAGGTTCTTTCAAAGTTAATGAAGACTTATCAGATTAAACGGTTTTTGGTATTCTTAGAT 
CCGGCTATTGATGCTAAGGGTGCTGGTTGGAATTTAAATCAGGTTAAAATAGCAATTGGTTCTGGCGGTCTTTTGG 
GCAAAGGATTTTTAAAGGGACCTTATACCCACGCTAATTATGTGCCATCTCAAAGCACAGATTTTATTTTTTCTAT 
TCTTGCCGAAGAGTTTGGGTTTTTGGGTGTTAGCACTATTTTAATATTATTTTTTTTCCTTTTTTTTAAATTTTTG 
ATAATAATGAATAAAAGTCAAGATAGATATATGGCCTTAGTAATATCTGGAATTTTGGGACTTTTATTTTTTCATA 
CTTCTTTTAATGTTGGAATGTCTTTAGGAGTTCTTCCTATTACCGGGATTCCCTTTCCTTTTCTCTCTTATGGAGG 
TTCTTCTACTATTACATTTTTTTTAGCAATGTCTTTTTATTTTAATATTGAATCAATAGTTGCTATGGATTGA 



tl57.nt 



AGAAAAAATTATGATTATTTGGCTTTGATAAGCTTACTTATAGTTTCTTTTGTTGGTATATTGTTGATTTATTCTA 

GCGATTATAATATTAGTGGATCTTTAACCAAGAATGAATATATAAAACAAACCTTTTGGGTAATTATTGGATTTTT 

TCTAATTTTTATAGTGGGCAAATATGATTTAAAATTTGTTTATAGCATGGTATATCCTTTATATTTTTTATTAATA 

TTGGCTTTAATTTTTACTGCATTTTTTGGAATGACAGTAAATGGAGCAAGATCTTGGATTGGCATATGGAAACTTG 

GAGGACAGCCTTCTGAATTTGGTAAAGTTGTTATTATTTTAACCCTTTCAAAATTTTACACTGAAAAAAAGGGTTA 

TAATGAATTTTTTACCTTTATTACTGCATTTTTATTAATTTTTCCATCGGTAATTCTTATATTATTGCAACCTGAT 

TTTGGTACAGCAATAGTATATTTAACCATTTTTATATTTATTTCTTTTTTTGCAGGAATAGATTTGCACTATGTTT 

TAGCATTTGCGTTGATAGGGTTTTTTTCTTTTGTTTTTGCAATTTTACCGGTTTGGTATGAATATAAGGTGAATAT 

GGGTAATGTATTTTATCTTATTTTCTCAAATCCTTTTTATTTTAGAGTAATAATGGGAGTGCTGCTTTT^ 

TTGATTTCTGTTTTAGGATTTTTCATTTCTAAATATGGTTTGAGTATTAAAATAATTTATTTTTATGTATTTTTTG 

CAAGTTCTATTTTATTAGTTTCAATAGTGTTTTCAAAGGTTCTTTCAAAGTTAATGAAGACTTATCAGATTAAACG 

GTTTTTGGTATTCTTAGATCCGGCTATTGATGCTAAGGGTGCTGGTTGGAATTTAAATCAGGTTAAAATAGCAATT 

GGTTCTGGCGGTCTTTTGGGCAAAGGATTTTTAAAGGGACCTTATACCCACGCTAATTATGTGCCATCTCAAAGCA 

CAGATTTTATTTTTTCTATTCTTGCCGAAGAGTTTGGGTTTTTGGGTGTTAGCACTATTTTAATATTATTTTTTTT 

CCTTTTTTTTAAATTTTTGATAATAATGAATAAAAGTCAAGATAGATATATGGCCTTAGTAATATCTGGAATTTTG 

GGACTTTTATTTTTTCATACTTCTTTTAATGTTGGAATGTCTTTAGGAGTTCTTCCTATTACCGGGATTCCCTTTC 

CTTTTCTCTCTTATGGAGGTTCTTCTACTATTACATTTTTTTTAGCAATGTCTTTTTATTTTAATATTGAATCAAT 

AGTTGCTATGGATTGA 



fl7.aa 



MIVFLFFSIYLIILFKRSSNSPLYFVPDTKFETLSIRIVLSCSLLLIFFCTMLDARPSTIAVFPTPGSPISIALFL 
FLLKSIFVRVLISASLPTKGSNFLAFASAVKFLTYFPISKCSFSSRISSSNSL 
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tl7 .aa 



PLYFVPDTKFETLSIRIVLSCSLLLIFFCTMLDARPSTIAVFPTPGSPISIALFLFLLKSIFVRVLISASLPTKGS 
NFLAFASAVKFLTYFPISKCSFSSRISSSNSL 



fl7.nt 



ATGATTGTGTTTTTGTTTTTTTCAATATACTTAATTATATTATTTAAACGATCTTCAAACTCGCCTCTATATTTTG 
TTCCCGATACCAAGTTTGAAACCTTAAGCATTAGAATTGTTTTGTCTTGTAGTTTGCTACTTATTTTTTTTTGCAC 
TATGCTTGATGCAAGGCCTTCAACTATTGCTGTTTTTCCCACACCAGGTTCGCCTATTAGCATTGCACTATTTTTA 
TTTCTTCTCAAGAGTATATTTGTAAGAGTTTTAATCTCTGCTTCTCTTCCAACCAAGGGGTCTAATTTTTTGGCTT 
TTGCAAGTGCTGTTAAATTTTTGACATACTTTCCAATTTCAAAGTGCTCATTTTCAAGTCGTATTTCTTCATCAAA 
TTCTTTGTAG 



tl7 .nt 



CCTCTATATTTTGTTCCCGATACCAAGTTTGAAACCTTAAGCATTAGAATTGTTTTGTCTTGTAGTTTGCTACTTA 
TTTTTTTTTGCACTATGCTTGATGCAAGGCCTTCAACTATTGCTGTTTTTCCCACACCAGGTTCGCCTATTAGCAT 
TGCACTATTTTTATTTCTTCTCAAGAGTATATTTGTAAGAGTTTTAATCTCTGCTTCTCTTCCAACCAAGGGGTCT 
AATTTTTTGGCTTTTGCAAGTGCTGTTAAATTTTTGACATACTTTCCAATTTCAAAGTGCTCATTTTCAAGTCGTA 
TTTCTTCATCAAATTCTTTGTAG 



fl70.aa 



MKAFKVKNLRRFSNFIRILVIVLFIiNSLLSLFVFIiAGSYNIFVYNFQKFYLDLAIILSSVSFGLESTRLIFFYFLK 
NKKIKYYLILIFSFIIFFIALVFKIFLSGNK 



tl70 .aa 



YNIFVYNFQKFYLDLAIILSSVSFGLESTRLIFFYFLKNKKIKYYLILIFSFIIFFIALVFKIFLSGNK 



fl70.nt 



ATGAAAGCTTTTAAAGTAAAAAATCTAAGACGTTTTTCAAATTTTATTAGAATTTTGGTTATTGTATTGTTTTTAA 

ATTCTTTGTTAAGTTTGTTCGTGTTTTTGGCTGGTTCTTACAATATTTTTGTTTACAATTTTCAGAAATTTTATCT 

TGATCTTGCTATTATTTTAAGCTCTGTTTCTTTTGGACTTGAATCTACTAGACTGATATTTTTTTATTTTTTGAAA 

AATAAAAAAATTAAGTATTATTTAATTTTAATTTTTAGTTTTATAATTTTTTTTATTGCTCTTGTTTTTAAAAT 

TTCTTTCTGGTAATAA 

ATAG 



tl70.nt 
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TACAATATTTTTGTTTACAATTTTCAGAAATTTTATCTTGATCTTGCTATTATTTTAAGCTCTGTTTCTTTTGGAC 
TTGAATCTACTAGACTGATATTTTTTTATTTTTTGAAAAATAAAAAAATTAAGTATTATTTAATTTTAATTTTTAG 
TTTTATAATTTTTTTTATTGCTCTTGTTTTTAAAATTTTTCTTTCTGGTAATAAATAG 



fl86.aa 



MKKLIIIFTLFLSQACNLSTMHKIIXPKEDMKILYSEIAELRKKIJSrLNHLEIDDTLEKVAKEYAIKLGENRTITHTL 
FGTTPMQRIHKYDQSFNLTREILASGIELNRWNAWLNSPSHKEALINTDTDKIGGYRLKTTDNIDIFWLFGKRK 
YKN 



tl86 .aa 



TMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTLFGTTPMQRIHKYDQSFNLT 
REILASGIELNRWNAWLNSPSHKEALINTDTDKIGGYRLKTTDNIDIFWLFGKRKYKN 



fl86 ,nt 



ATGAAAAAATTGATTATAATTTTTACACTGTTTTTATCTCAAGCATGCAATTTAAGTACAATGCATAAAATAGATA 
CAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGAGAAAAAAATTAAATCTAAACCATCTAGAAAT 
AGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACTGGGAGAAAATAGAACAATAACTCACACCCTT 
TTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCCTTTAATTTAACAAGAGAAATACTGGCATCAG 
GAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAAGCCACAAAGAAGCTCTTATTAATACAGATAC 
CGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATATATTTGTAGTTCTTTTTGGAAAAAGAAAA 
TATAAGAATTGA 



tl86 .nt 



ACAATGCATAAAATAGATACAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGAGAAAAAAATTAA 
ATCTAAACCATCTAGAAATAGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACTGGGAGAAAATAG 
AACAATAACTCACACCCTTTTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCCTTTAATTTAACA 
AGAGAAATACTGGCATCAGGAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAAGCCACAAAGAAG 
CTCTTATTAATACAGATACCGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATATATTTGTAGT 
TCTTTTTGGAAAAAGAAAATATAAGAATTGA 



fl96 .aa 



MKLKARMLLLVLILIAFFISILFFAFGMLINSKLVDQQFNLMINLIESIKSSFNLYISSMEEKVRVSSMYF^ 

FNEASKIKSKRLSFISIX^SEILIQTGSNMMVTDKEGKIVFTTAVKDNSDFGKSIGDREYFTKLKE 

lADPGSIEESLLKDISKIKNKKGQIPYILIGMPLRDFETDNIFGYFMFLYSMDYIYRSFRGINFGILSSGRALAYD 

TTGRLLVHHWL PGD I LTDI S AS YSN 1 1 KKTS EDLLQKNKE I STVYYYDPKSNKKYVG I SQKVLLNLSNNKF I LLM 

RTSEDDFYYMSRATTIILAISFVFTLI^LAIATLYLVKKLSSSLNKILEYSERLASGNFTADINFGKV^ 

YEGLEQLRTNFSSVAKGVIENLDYLYENAIQIANASQNLSSGAVEQASTLEQMTANIEQISQGVSENTENAATTEK 

lAVNTNERTKEGHKSVVKAIEAMTVITEKIGIIDEITRQTNLIAI^ASIEAARVGEKG 

ESAREIIDIANRSLWASRAGENFEQIVPGMEQTARLVKNISNESYKQSVQIEQFKNAIEQVSQLVQTTASSSEEL 
SAMSEKMLESVKDLKESVDYFKIEK 
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tl96 .aa 



MLINSKLVDQQFNLMINLIESIKSSFNLYISSMEEKVRVSSMYFNSAEKFNEASKIKSKRLSFISDQSEILIQTGS 

NMMVTDKEGKIVTTTAVKDNSDFGKSIGDREYFTKLKESNSIVYNSFVMLADPGSIEESLLKDISKIK^ 

ILIGMPLRDFETDNIFGYFMFLYSMDYIYRSFRGINFGILSSGRALAYDTTGRLLVHHWLPGDILTDISASYSNI 

IKKTSEDLLQKNKEISTWYYDPKSNKKYVGISQKVLLNLSNNKFILLMRTSEDDFYYMSRATTIIIAISFW 

MLAIATLYLVKKLSSSIJ^KILEYSERIJVSGNFTADINFGKWDTVELYSLYEGLEQLRTNFSSVAKGVIENLDYLY^ 

NAIQIANASQNLSSGAVEQASTLEQMTANIEQISQGVSENTENAATTEKIAVNTNERTKEGHKSWKAIEAiyr^ 

EKIGIIDEITRQTNLLALNASIEAARVGEKGKGFEWAAEVRKLADQSKESAREIIDIANRSLTVASRAGENFEQI 

VPGMEQTARLVKNISNESYKQSVQIEQFKNAIEQVSQLVQTTASSSEELSAMSEKMLESVKDLKESVDYFKIEK 



fl96 .nt 

ATGAAGCTTAAAGCTAGGATGTTGCTACTTGTTCTTATTCTGATAGCATTCTTTATATCAATTTTGTTTTTTGCTT 

TTGGAATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTTATGATAAATCTTATTGAAAGCATTAAAAG 

TTCTTTTAATCTTTACATCTCTTCAATGGAAGAGAAAGTTAGGGTTAGTTCCATGTATTTCAACTCTGCTGAAAAA 

TTTAATGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATCAATCTGAAATTCTTATTCAAACCG 

GTAGTAATATGATGGTTACAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTTAAGGATAATAGTGATTTTGG 

CAAATCTATTGGGGATAGAGAATATTTTACAAAACTTAAGGAGTCTAATAGTATTGTTTACAATTCCTTTGTCATG 

TTGGCAGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATATTTCCAAGATAAAAAATAAAAAAGGTCAGATTC 

CTTACATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAACATTTTTGGTTATTTTATGTTTCTTTATTC 

AATGGATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATACTCTCTAGCGGTCGTGCGCTAGCTTATGAT. 

ACTACGGGTAGATTGTTGGTTCATCATGTAGTATTGCCAGGTGATATTTTGACTGATATTAGTGCTTCTTATTCCA 

ATATTATTAAGAAAACATCTGAAGATTTGTTGCAAAAGAATAAAGAAATTTCAACTGTTTATTATTATGATCCTAA 

AAGCAATAAGAAATATGTGGGAATTAGTCAAAAGGTGTTATTAAACTTGTCTAATAATAAATTTATTCTTTTAATG 

AGAACTTCAGAGGACGATTTTTATTACATGTCACGAGCTACAACTATAATCTTAGCAATTAGTTTTGTATTTACAT 

TACTTATGCTTGCTATTGCAACTCTTTATCTTGTGAAAAAGTTAAGCTCTTCTTTGAATAAGATACTGGAATATTC 

TGAGAGACTTGCTTCTGGTAATTTTACTGCTGATATTAATTTTGGCAAATGGGATACTGTAGAGCTTTACAGTTTG 

TACGAAGGGCTTGAGCAGTTGAGAACCAATTTTTCTTCAGTTGCAAAAGGAGTTATTGAAAATCTAGATTATCTTT 

ATGAAAATGCAATTCAAATAGCAAATGCAAGCCAGAATTTAAGTTCTGGCGCTGTTGAGCAGGCTTCTACTTTAGA 

GCAAATGACAGCAAATATTGAGCAAATTTCACAAGGTGTTTCTGAGAATACTGAAAATGCAGCTACTACTGAAAAA 

ATTGCTGTTAATACTAATGAAAGGACTAAAGAGGGGCATAAATCTGTTGTTAAGGCTATTGAGGCAATGACTGTAA 

TTACTGAAAAAATTGGAATTATTGATGAGATAACAAGGCAAACCAATTTGCTTGCTTTAAATGCCTCGATTGAAGC 

TGCACGAGTGGGAGAAAAGGGCAAGGGATTTGAAGTGGTAGCTGCTGAGGTTAGAAAGCTTGCAGATCAAAGCAAA 

GAATCAGCAAGAGAGATTATTGATATTGCAAACAGAAGTTTAACTGTTGCAAGTCGTGCTGGGGAAAATTTTGAAC 

AAATAGTTCCTGGTATGGAACAAACAGCCAGACTTGTAAAAAATATTTCTAATGAAAGTTATAAGCAAAGTGTTC 

AATAGAGCAATTTAAAAATGCAATAGAGCAGGTTAGTCAGTTAGTCCAAACTACAGCCTCAAGCAGTGAAGAGCTT 

TCTGCAATGTCTGAAAAGATGTTAGAGAGTGTAAAAGATTTAAAAGAATCTGTTGATTATTTTAAGATCGAAAAGT 

AA 

tl96.nt 



ATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTTATGATAAATCTTATTGAAAGCATTAAAAGTO 

TTAATCTTTACATCTCTTCAATGGAAGAGAAAGTTAGGGTTAGTTCCATGTATTTCAACTCTGCTGAAAAATO 

TGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATCAATCTGAAATTCTTATTCAAACCGGTAGT 

AATATGATGGTTACAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTTAAGGATAATAGTGATTTTGGCAA^ 

CTATTGGGGATAGAGAATATTTTACAAAACTTAAGGAGTCTAATAGTATTGTTTACAATTCCTTTGTCATGTT^ 

AGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATATTTCCAAGATAAAAAATAAAAAAGGTCAGATTCCTTAC 

ATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAACATTTTTGGTTATTTTATGTTTCTTTATTCAATGG 

ATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATACTCTCTAGCGGTCGTGCGCTAGCTTATC 

GGGTAGATTGTTGGTTCATCATGTAGTATTGCCAGGTGATATTTTGACTGATATTAGTC 
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attaagaaaacatctgaagatttgttgcaaaagaataaagaaatttcaactgtttattattatgatcctaaaagca 

ataagaaatatgtgggaattagtcaaaaggtgttattaaacttgtctaataataaatttattcttttaatgag;^ 

ttcagaggacgatttttattacatgtcacgagctacaactataatcttagcaattagttttgtatttacattactt 

atgcttgctattgcaactctttatcttgtgaaaaagttaagctcttctttgaataagatactggaatattctgaga 

gacttgcttctggtaattttactgctgatattaattttggcaaatgggatactgtagagctttacagtttgtacga 

agggcttgagcagttgagaaccaatttttcttcagttgcaaaaggagttattgaaaatctagattatctttatgaa 

aatgcaattcaaatagcaaatgcaagccagaatttaagttctggcgctgttgagcaggcttctactttagagcaaa 

TGACAGCAT^TATTGAGCAAATTTCACAAGGTGTTTCTGAGAATACTGAAAATGCAGCTACTACTGAAAAAATTGC 

tgttaatactaatgaaaggactaaagaggggcataaatctgttgttaaggctattgaggcaatgactgtaattact 
gaaaaaattggaattattgatgagataacaaggcaaaccaatttgcttgctttaaatgcctcgattgaagctgcac 
gagtgggagaaaagggcaagggatttgaagtggtagctgctgaggttagaaagcttgcagatcaaagcaaagaatc 
agcaagagagattattgatattgcaaacagaagtttaactgttgcaagtcgtgctggggaaaattttgaacaaata 
gttcctggtatggaacaaacagccagacttgtaaaaaatatttctaatgaaagttataagcaaagtgttcaaatag 
agcaatttaaaaatgcaatagagcaggttagtcagttagtccaaactacagcctcaagcagtgaagagctttctgc 
aatgtctgaaaagatgttagagagtgtaaaagatttaaaagaatctgttgattattttaagatcgaaaagtaa 



f 899.aa 



MRFIIAFLMILNQGFSNLFSLPPEDIIFESSYEVAIKKAQKLNKNVLILVGRDIKENLIKDFLNSFTNGEIIHKVS 
RKSVFLVIDKDNEIFNKINLQKSPTIFFVDSKNEQIKAAYVGAVLSSVQFDKDFLNYVMGAIKSTSVLKKQKDYEI 
NTADERTFFYKTLKGDWRLKFNGKDRKLVLFDTDLKEFLVFKDINENKLYAIPKSRIGNIYFSLLGNEEWKLFGKI 
K 



t899 .aa 
f 899.nt 



ATGAGATTTATAATTGCATTTTTAATGATTTTAAATCAAGGATTTTCAAATTTGTTTTCTTTGCCTCCGGAAGATA 
TTATTTTTGAGAGTTCTTATGAGGTTGCAATTAAAAAAGCTCAAAAATTGAATAAAAATGTTTTAATTTTGGTTGG 
TAGAGATATTAAAGAAAATTTAATAAAAGATTTTTTAAACTCTTTTACAAATGGTGAAATTATTCACAAAGTATCT 
AGAAAAAGTGTTTTTTTAGTTATTGATAAGGATAATGAAATTTTTAATAAAATTAATCTACAAAAAAGTCCGACTA 
TTTTTTTTGTTGATTCTAAGAATGAGCAAATAAAGGCAGCTTATGTGGGAGCTGTTTTGAGCAGTGTTCAATTTGA 
TAAGGATTTTTTAAACTATGTTATGGGAGCTATAAAATCAACAAGTGTTTTAAAAAAGCAAAAAGATTATGAAATT 
AATACTGCTGATGAGAGAACCTTTTTTTACAAAACATTAAAAGGTGATTGGCGATTAAAGTTTAATGGTAAAGACA 
GAAAGCTTGTTCTTTTTGATACAGATCTTAAAGAATTTTTAGTTTTTAAAGATATTAATGAAAACAAGCTTTATGC 
TATTCCTAAGTCTAGGATTGGTAATATTTATTTTTCATTATTGGGAAATGAAGAATGGAAGCTTTTTGGAAAAATA 
AAATAA 



t899 .nt 



TTGCCTCCGGAAGATATTATTTTTGAGAGTTCTTATGAGGTTGCAATTAAAAAAGCTCAAAAATTGAATAAAAATG 

TTTTAATTTTGGTTGGTAGAGATATTAAAGAAAATTTAATAAAAGATTTTTTAAACTCTTTTACAAATGGTGA 

TATTCACAAAGTATCTAGAAAAAGTGTTTTTTTAGTTATTGATAAGGATAATGAAATTTTTAATAAAATTAATCTA 

CAAAAAAGTCCGACTATTTTTTTTGTTGATTCTAAGAATGAGCAAATAAAGGCAGCTTATGTGGGAGCTGTTTTGA 

GCAGTGTTCAATTTGATAAGGATTTTTTAAACTATGTTATGGGAGCTATAAAATCAACAAGTGTTTTAAAAAAGCA 

AAAAGATTATGAAATTAATACTGCTGATGAGAGAACCTTTTTTTACAAAACATTAAAAGGTGATTG^^ 

TTTAATGGTAAAGACAGAAAGCTTGTTCTTTTTGATACAGATCTTAAAGAATTTTTAGTTTTTAAAGATATTAATG 

AAAACAAGCTTTATGCTATTCCTAAGTCTAGGATTGGTAATATTTATTTTTCATTATTGGGAAATGAAGA^ 

GCTTTTTGGAAAAATAAAATAA 
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f924.aa 



MQDRKFSFRKYFLISVPLIFIVSGITYFYSTQMLEKSQKCVEDNLDAKVKLVDMEDFYFDLNECIJ^JMDDFFIPRPD 
FLNENLNKNLWDGLIKNKFLDENFFKDLWIKKENLFNVDIEKENEKLIDKILEISK 



t924 .aa 



TQMLEKSQKCVEDNLDAKVKLVDMEDFYFDLNECLNMDDFFIPRPDFLNENLNKNLVVDGLIKNKFLDENFFKDLW 
IKKENLFNVDIEKENEKLIDKILEISK 



f924.nt 



ATGCAAGATAGAAAGTTTAGTTTTAGAAAATATTTTTTAATTTCAGTATTTTTGATTTTTATTGTTTCTGGTATTA 
CTTATTTCTATTCAACACAAATGTTGGAAAAATCTCAAAAGTGTGTTGAAGACAATTTAGACGCTAAGGTTAAATT 
AGTTGATATGGAAGATTTTTATTTTGATTTAAATGAATGTCTAAATATGGATGATTTTTTTATTCCAAGACCTGAT 
TTTTTAAATGAAAATTTAAATAAGAATTTAGTTGTTGATGGATTGATTAAAAATAAATTTCTTGATGAGAATTTTT 
TCAAGGATCTTTGGATTAAAAAGGAAAATTTATTTAACGTTGATATTGAGAAGGAGAATGAAAAATTAATAGATAA 
GATTTTAGAAATTTCCAAATGA 



t924 .nt 



ACACAAATGTTGGAAAAATCTCAAAAGTGTGTTGAAGACAATTTAGACGCTAAGGTTAAATTAGTTGATATGGAAG 
ATTTTTATTTTGATTTAAATGAATGTCTAAATATGGATGATTTTTTTATTCCAAGACCTGATTTTTTAAATGAAAA 
TTTAAATAAGAATTTAGTTGTTGATGGATTGATTAAAAATAAATTTCTTGATGAGAATTTTTTCAAGGATCTTTGG 
ATTAAAAAGGAAAATTTATTTAACGTTGATATTGAGAAGGAGAATGAAAAATTAATAGATAAGATTTTAGAAATTT 
CCAAATGA 

f925.aa 

MIRKYLIYISLLFIVFEVYSKPAFISQDDSYELDFSSGEVDISVNTNSKFNLSFKDESWIYIKSIENEAFIKLIGE 
SYDNGAVFTFQTFKKEGKIKLVFTYQNVKDSSEFNKIIILKITKNFEVAIPQGVGGGSSRDNNIETGNNLELGGGS 
ISGATSKEIIVRALNLSYINDYKGAIDLLNKYNFNDDKYILLKAEIHYKNGDYLKSYENYLKLKSKYFQSIVFDLI 
RLAIEIJ^IKEEVLENARYLVEKNVDFSESIYL.EIFEFLVTRGEHEFALNFSSLYFPKYINSSFSDKYSYLLGKLYE 
SESKHKDFLKALHYYKLVIDNYPFSYYYERAKIRYLFLKRFF 

t925 .aa 

KPAFISQDDSYELDFSSGEVDISVNTNSKFNLSFKDESWIYIKSIENEAFIKLIGESYDNGAVFTFQTFKKEGKIK 
LVFTYQNVKDSSEFNKIIILKITKNFEVAIPQGVGGGSSRDNNIETGNNLELGGGSISGATSKEIIVRALNLSYIN 
DYKGAIDLLNKYNFl^DKYILLKAEIHYKNGDYLKSYE^nfLKLKSKYFQSIVFDLIRLlAIELNIKEEVLENARYLV 
EKNVDFSESIYLEIFEFLVTRGEHEFALNFSSLYFPKYINSSFSDKYSYLLGKLYESESKHKDFLKALHYYKLVID 
NYPFSYYYERAKIRYLFLKRFF 

f925.nt 

ATGATTAGAAAATATTTGATTTATATAAGTTTGCTATTTATTGTTTTTGAAGTTTACTCTAAGCCAGCTT 

GTCAAGACGATTCGTATGAGCTTGATTTTAGTAGTGGAGAGGTAGATATTAGTGTAAATACCAATTCAAAAT^ 

TCTTTCTTTTAAAGATGAGTCTTGGATTTATATCAAAAGCATTGAAAATG^ 
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TCTTATGATAACGGTGCTGTTTTTACTTTTCAGACTTTTAAAAAAGAAGGCAAAATTAAATTGGT^ 

AAAATGTTAAAGATTCAAGTGAATTTAATAAAATAATTATCTTGAAAATTACAAAGAATTTTGAAGTTGCAATTCC 

ACAAGGCGTTGGTGGTGGCTCTAGCAGGGACAATAACATTGAAACTGGTAATAATCTTGAACTTGGGGGGGGGAGT 

ATTAGCGGGGCAACTTCTAAAGAGATTATTGTTAGGGCTTTAAATTTGTCCTACATAAATGATTACAAAGGAGCAA 

TAGATTTGCTTAATAAGTATAATTTCAATGACGATAAATATATTTTATTGAAGGCGGAAATTCATTATAAAAATGG 

TGATTATTTAAAATCTTATGAAAATTATTTGAAATTGAAGAGTAAATATTTTCAAAGCATTGTTTTTGATCTAATT 

AGGCTTGCTATAGAATTAAATATTAAAGAAGAGGTTTTAGAGAACGCTAGATATTTAGTTGAAAAGAATGTTGATT 

TTTCTGAGAGCATTTATCTTGAGATCTTTGAATTCTTAGTAACAAGGGGAGAGCATGAGTTTGCTTTAAATTTTAG 

CTCTCTTTACTTTCCTAAGTATATTAATTCAAGCTTTTCAGACAAATATAGTTATTTGTTGGGAAAACTTTATGAG 

TCTGAGAGCAAGCATAAAGATTTTTTAAAGGCTTTGCATTACTATAAATTGGTTATTGATAATTACCCTTTTAGTT 

ATTATTATGAGAGAGCCAAGATAAGATATTTATTTTTAAAGCGGTTTTTTTAG 

t925 .nt 

AAGCCAGCTTTTATAAGTCAAGACGATTCGTATGAGCTTGATTTTAGTAGTGGAGAGGTAGATATTAGTGTAAATA 
CCAATTCAAAATTTAATCTTTCTTTTAAAGATGAGTCTTGGATTTATATCAAAAGCATTGAAAATGAAGCTTTTAT 
TAAGTTAATTGGAGAATCTTATGATAACGGTGCTGTTTTTACTTTTCAGACTTTTAAAAAAGAAGGCAAAATTAAA 
TTGGTTTTCACTTATCAAAATGTTAAAGATTCAAGTGAATTTAATAAAATAATTATCTTGAAAATTACAAAGAATT 
TTGAAGTTGCAATTCCACAAGGCGTTGGTGGTGGCTCTAGCAGGGACAATAACATTGAAACTGGTAATAATCTTGA 
ACTTGGGGGGGGGAGTATTAGCGGGGCAACTTCTAAAGAGATTATTGTTAGGGCTTTAAATTTGTCCTACATAAAT 
GATTACAAAGGAGCAATAGATTTGCTTAATAAGTATAATTTCAATGACGATAAATATATTTTATTGAAGGCGGAAA 
TTCATTATAAAAATGGTGATTATTTAAAATCTTATGAAAATTATTTGAAATTGAAGAGTAAATATTTTCAAAGCAT 
TGTTTTTGATCTAATTAGGCTTGCTATAGAATTAAATATTAAAGAAGAGGTTTTAGAGAACGCTAGATATTTAGTT 
GAAAAGAATGTTGATTTTTCTGAGAGCATTTATCTTGAGATCTTTGAATTCTTAGTAACAAGGGGAGAGCATGAGT 
TTGCTTTAAATTTTAGCTCTCTTTACTTTCCTAAGTATATTAATTCAAGCTTTTCAGACAAATATAGTTATTTGTT 
GGGTV/U^CTTTATGAGTCTGAGAGCAAGCATAAAGATTTTTTAAAGGCTTTGCATTACTATAAATTGGTTATTGAT 
AATTACCCTTTTAGTTATTATTATGAGAGAGCCAAGATAAGATATTTATTTTTAAAGCGGTTTTTTTAG 



f929.aa 



MTKVLWSAIALLSKDKELIPFYKFLFLFFFFTLLACSKVSKDFIVFNKDVKTSSRIDNPNSNVLEVNKMEDFFGD 

IIDLKGYKILSVQQENUSrLDVYFEQWLAQNFSNLNAYLFIIGFDPKIKAGTILFKTQIDIDPKNSyN^^trLED^ 

DYDFNIVIQGFLKDKSVLWFQKSVLNDVSSYRPIFFDKVNGTVLINKYARSSAYEENRSRESYPISLEKYEKVGE 

DLIISKIEKYEYSNVQGRYCLSSVSEKVGKIDNNIYKTLKNLSKDEVYKFLHGVWYDVHDYNKMHVKDIDEVLFLS 

FERQSSEINLFRKNSQEVAKIEYISKPAYNTLWSAKSLFSDLIVYNFWIKIVDKENIEIKIDTSTNSYDNSGFSG 

TFKRFDEmriJSrVKKGSSDIYFIPSGNYWKDKIYDFSYPHLTYIDENKIYYGIFNIFPLKNNF 

VESFFLEHSERIVQKQKFSTIILNPIKILKDDVSLVKGQKLKLERIEKI 



t929 .aa 



KDKELIPFYKFLFLFFFFTLLACSKVSKDFIVFNKDVKTSSRIDNPNSNVLEVNKMEDFFGDIIDLKGYKILSVQQ 
ENIJSTLDWFEQWLAQNFSNLNAYLFIIGFDPKIKAGTILFKTQIDIDPKNSYNMYLEDITGDYDFNIVIQGFLKD 
KSVLYVFQKSVLNDVSSYRPIFFDKVNGTVLINKYARSSAYEENRSRESYPISLEKYEKVGEDLIISKIEKYEYSN 
VQGRYCLSSVSEKVGKIDNNIYKTLKNLSKX>EWKFLHGVWYDVHDYNKMHVKDIDEV^ 

SQEVAKIEYISKPAYNTLWSAKSLFSDLIVYNFWIKIVDKENIEIKIDTSTNSYDNSGFSGTFKRFDENVO^^ 

GSSDIYFIPSGISTYVYKDKIYDFSYPHLTYIDENKIYYGIFNIFPLKNNFVLEYEIDMGSYKLVESFFLEHSERIVQ 

KQKFSTIILNPIKILKDDVSLVKGQKLKLERIEKI 



f929.nt 
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ATGACAAAGGTTTTGGTTGTTAGTGCGATTGCTCTTCTGAGTAAGGATAAAGAATTAATCCCATTTTATAAATTTT 

TGTTTTTATTCTTTTTTTTTACATTACTTGCTTGTTCCAAGGTAAGCAAAGATTTTATTGTTTTT^ 

AAAGACTTCTTCCAGGATCGATAATCCAAATTCCAATGTTTTAGAAGTTAATAAAATGGAAGATTTTTTTGGAGAT 

ATTATAGATTTAAAAGGTTATi\AAATTCTTTCAGTTCAGCAGGAAAATTTAAATTTAGATGTGTATTTTGAGCAGG 

TGGTTTTAGCTCAAAATTTTTCAAATCTTAATGCATATTTGTTTATTATTGGTTTTGATCCTAAAATTAAAGCTGG 

AACGATTCTTTTTAAAACTCAAATAGATATTGATCCAAAAAATTCTTATAACATGTATCTTGAAGATATTACAGGT 

GATTATGATTTTAATATAGTTATTCAAGGATTTTTAAAAGATAAATCTGTTTTGTATGTTTTTCAAAAATCTGTTT 

TAAATGATGTGTCTTCTTATAGGCCTATATTTTTTGACAAAGTTAATGGAACTGTTCTTATTAATAAGTATGCAAG 

ATCTTCAGCTTATGAAGAAAACAGATCAAGAGAAAGCTATCCTATTTCTTTAGAAAAATATGAAAAAGTGGGGGAA 

GATTTAATAATTAGCAAGATTGAAAAATATGAATATTCTAATGTTCAGGGTAGATATTGTCTTTCTTCTGTGAGCG 

AAAAAGTTGGTAAAATTGATAATAATATTTATAAAACTTTAAAGAATTTAAGCAAAGATGAAGTTTATAAATTTTT 

GCATGGAGTTTGGTATGATGTTCATGACTATAATAAAATGCATGTCAAAGATATTGATGAAGTTTTATTCTTGTCT 

TTTGAAAGGCAATCAAGCGAGATTAATCTTTTCAGGAAAAATTCTCAAGAAGTTGCAAAGATTGAATATATTTCAA 

AACCTGCTTACAATACTCTTAATGTTAGTGCAAAGTCTCTTTTTTCAGATTTGATAGTTTATAACTTTTGGATCAA 

AATTGTAGATAAAGAAAACATTGAAATCAAAATTGACACTAGCACAAATTCTTATGATAATAGTGGATTTTCGGGT 

ACATTTAAGAGGTTTGATGAGAATGTCTTAAATGTTAAAAAAGGGAGTAGTGATATTTATTTTATTCCTAGTGGAA 

ATTACGTGTATAAGGATAAAATTTATGATTTTTCTTACCCCCATTTAACTTATATTGATGAGAATAAAATTTATTA 

TGGCATTTTTAATATTTTTCCTTTAAAAAATAATTTTGTTCTTGAATATGAGATTGACATGGGTAGTTACAAGCTT 

GTTGAATCTTTTTTCCTTGAGCATAGCGAAAGAATTGTTCAAAAGCAAAAATTTTCTACAATCATTTTAAATCCTA 

TTAAAATTTTAAAAGATGATGTAAGCTTAGTTAAAGGGCAAAAATTAAAGCTTGAGCGAATAGAAAAAATATGA 

t929 .nt 



AAGGATAAAGAATTAATCCCATTTTATAAATTTTTGTTTTTATTCTTTTTTTTTACATTACTTGCTTGTTCCAAGG 

TAAGCAAAGATTTTATTGTTTTTAACAAAGATGTAAAGACTTCTTCCAGGATCGATAATCCAAATTCCAATGTTTT 

AGAAGTTAATAAAATGGAAGATTTTTTTGGAGATATTATAGATTTAAAAGGTTATAAAATTCTTTCAGTTCAGCAG 

GAAAATTTAAATTTAGATGTGTATTTTGAGCAGGTGGTTTTAGCTCAAAATTTTTCAAATCTTAATGCATATTTGT 

TTATTATTGGTTTTGATCCTAAAATTAAAGCTGGAACGATTCTTTTTAAAACTCAAATAGATATTGATCCAAAAAA 

TTCTTATAACATGTATCTTGAAGATATTACAGGTGATTATGATTTTAATATAGTTATTCAAGGATTTTTAAAAGAT 

AAATCTGTTTTGTATGTTTTTCAAAAATCTGTTTTAAATGATGTGTCTTCTTATAGGCCTATATTTTTTGACAAAG 

TTAATGGAACTGTTCTTATTAATAAGTATGCAAGATCTTCAGCTTATGAAGAAAACAGATCAAGAGAAAGCTATCC 

TATTTCTTTAGAAAAATATGAAAAAGTGGGGGAAGATTTAATAATTAGCAAGATTGAAAAATATGAATATTCTAAT 

GTTCAGGGTAGATATTGTCTTTCTTCTGTGAGCGAAAAAGTTGGTAAAATTGATAATAATATTTATAAAACTTTAA 

AGAATTTAAGCAAAGATGAAGTTTATAAATTTTTGCATGGAGTTTGGTATGATGTTCATGACTATAATAAAATGCA 

TGTCAAAGATATTGATGAAGTTTTATTCTTGTCTTOyrcAAAGGCAATCAAGCGAGATTAATCTTTTCAGGAAAA^ 

TCTCAAGAAGTTGCAAAGATTGAATATATTTCAAAACCTGCTTACAATACTCTTAATGTTAGTGCAAAGTCTCTTT 

TTTCAGATTTGATAGTTTATAACTTTTGGATCAAAATTGTAGATAAAGAAAACATTGAAATCAAAATTGACACTAG 

CACAAATTCTTATGATAATAGTGGATTTTCGGGTACATTTAAGAGGTTTGATGAGAATGTCTTAAATGTTAAAAAA 

GGGAGTAGTGATATTTATTTTATTCCTAGTGGAAATTACGTGTATAAGGATAAAATTTATGATTTTTCTTACCCCC 

ATTTAACTTATATTGATGAGAATAAAATTTATTATGGCATTTTTAATATTTTTCCTTTAAAAAATAATTTTGTTCT 

TGAATATGAGATTGACATGGGTAGTTACAAGCTTGTTGAATCTTTTTTCCTTGAGCATAGCGAAAGAATTGTTCAA 

AAGCAAAAATTTTCTACAATCATTTTAAATCCTATTAAAATTTTAAAAGATGATGTAAGCTTAGTTAAAGGGCAAA 

AATTAAAGCTTGAGCGAATAGAAAAAATATGA 



f933.aa 



MNKLLIFVLATFCVFSSFAQANDSKNGAFGMSAGEKLLVYETSKQDPIVPFLLNLFLGFGIGSFAQGDILGGSLIL 

GFDAVGIGLIIAGAYLDIKALIXSITKKAAFQWTWGKGVMLAGVWMAVTRLTEIILPFTFANSYNR^ 

GGFEPSFDVAMGQSSALGFELSFKKSY 



t933.aa 
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NDSKNGAFGMSAGEKLLVYETSKQDPIVPFLLNLFLGFGIGSFAQGDILGGSLILGFDAVGIGLILAGAYLDIKAL 

DGITKKJ^FQWTWGKGVMLJVGVVTMAVTRLTEIILPFTFANSYNRKLKNSLWALGGFEPSFDVAM 

SFKKSY 



f933.nt 



ATGAATAAACTTTTAATTTTTGTTTTGGCAACCTTTTGTGTTTTTTCTAGCTTTGCTCAAGCTAATGATTCTAAAA 
ATGGTGCGTTTGGGATGAGTGCTGGAGAAAAACTTTTGGTTTATGAAACTAGCAAGCAAGATCCTATTGTACCATT 
TTTATTGAACCTTTTTTTAGGGTTTGGAATAGGCTCCTTTGCTCAAGGAGATATTCTTGGAGGTTCTCTTATTCTT 
GGATTTGATGCGGTTGGTATAGGGCTTATACTTGCGGGGGCTTATTTGGATATCAAAGCGCTTGATGGTATTACTA 
AAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGTTATGTTAGCAGGTGTGGTTACTATGGCTGTGACAAGATT 
AACAGAAATTATTCTTCCATTTACATTTGCTAATAGTTATAATAGGAAGCTAAAAAATAGCCTTAATGTAGCTTTA 
GGAGGATTTGAACCTAGTTTTGATGTTGCAATGGGCCAATCCAGTGCTCTTGGGTTTGAACTGTCTTTCAAAAAAA 
GCTATTAA 



t933 .nt 



AATGATTCTAAAAATGGTGCGTTTGGGATGAGTGCTGGAGAAAAACTTTTGGTTTATGAAACTAGCAAGCAAGATC 
CTATTGTACCATTTTTATTGAACCTTTTTTTAGGGTTTGGAATAGGCTCCTTTGCTCAAGGAGATATTCTTGGAGG 
TTCTCTTATTCTTGGATTTGATGCGGTTGGTATAGGGCTTATACTTGCGGGGGCTTATTTGGATATCAAAGCGCTT 
GATGGTATTACTAAAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGTTATGTTAGCAGGTGTGGTTACTATGG 
CTGTGACAAGATTAACAGAAATTATTCTTCCATTTACATTTGCTAATAGTTATAATAGGAAGCTAAAAAATAGCCT 
TAATGTAGCTTTAGGAGGATTTGAACCTAGTTTTGATGTTGCAATGGGCCAATCCAGTGCTCTTGGGTTTGAACTG 
TCTTTCAAAAAAAGCTATTAA 



£940 .aa 



MRKYIFIILIAVX.LIGVNIKKIAAAANIDRHTNSTLGIDLSVGIPIFYNDLSKAYPTNLYPGGIGAIKYQYHILNN 

LAIGLELRYMFNFDINHSFNILNPDSSVGKIFYSVPITFSINYIFDIGELFQIPVFTNIGFSUSTTYGDRNNN 

RTFDALPTISFGSGILWNFNYKWAFGATASWWMMFEFGNSAKMAHFALVSLSVTVNVNKL 



t940.aa 



ANIDRHTNSTLGIDLSVGIPIFYNDLSKAYPTNLYPGGIGAIKYQYHILNNLAIGLELRYMFNFDINHSFNILNP 

SSVGKIFYSVPITFSINYIFDIGELFQIPVFTNIGFSlJSrTYGDRNl^ITNLRTFDALPTISFGSGILWNFNYKWAF 

GATASWWMMFEFGNSAKMAHFALVSLSVTVNVNKL 



f940.nt 



ATGAGAAAGTATATTTTTATAATACTAATTGCAGTCTTGCTAATTGGTGTAAACATAAAAAAAATTGCGGCCGCAG 

CCAATATTGATAGGCATACAAACTCCACTTTAGGAATAGATTTAAGTGTAGGAATCCCTATTTTTTACAACGACTT 

ATCAAAAGCTTATCCTACCAATTTATATCCAGGAGGTATTGGGGCAATAAAATACCAGTACCATATTTTAAACAAT 

TTAGCAATTGGACTTGAACTAAGGTATATGTTTAACTTTGATATTAACCATTCTTTTAATATATO 

CAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATTACATTTTCAATAAATTATATATTTGATATA 
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TCAAATTCCAGTCTTCACAAATATAGGGTTTTCTCTTAATACATATGGAGATAGAAACAACAATATTACAAATTTA 
AGAACTTTTGATGCACTCCCTACAATCTCTTTTGGATCTGGAATTTTATGGAACTTTAACTATAAATGGGCTTTTG 
GAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGAAATTCTGCTAAAATGGCACATTTTGCACTTGTATCATT 
ATCAGTTACAGTGAATGTAAATAAATTGTAG 



t940 .nt 



GCCAATATTGATAGGCATACAAACTCCACTTTAGGAATAGATTTAAGTGTAGGAATCCCTATTTTTTACAACGACT 
TATCAAAAGCTTATCCTACCAATTTATATCCAGGAGGTATTGGGGCAATAAAATACCAGTACCATATTTTAAACAA 
TTTAGCAATTGGACTTGAACTAAGGTATATGTTTAACTTTGATATTAACCATTCTTTTAATATATTAAATCCAGAT 
TCAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATTACATTTTCAATAAATTATATATTTGATATAGGAGAATTAT 
TTCAAATTCCAGTCTTCACAAATATAGGGTTTTCTCTTAATACATATGGAGATAGAAACAACAATATTACAAATTT 
AAGAACTTTTGATGCACTCCCTACAATCTCTTTTGGATCTGGAATTTTATGGAACTTTAACTATAAATGGGCTTTT 
GGAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGAAATTCTGCTAAAATGGCACATTTTGCACTTGTATCAT 
TATCAGTTACAGTGAATGTAAATAAATTGTAG 



f943 .aa 



MKNQFLNSYFQLITTIFLISSITIAAEEITSTLKVPNGFKVEIFLNNTIEKPRGITSDQDGNIFIGSGSTFAYFVT 

KNRKIYTIAKTLQKPIGIDYWDNKLYISSVDKIYWKNVKEEINKSIKSHKDYTWKMQIFALLPKNN 

KVDSKNNKLIVNIGSQHNVKIPPKKEAVILSINLKTKKEEIVAFGVRNSVGFDFHPISNEIYFSDNGQDGLGDNIP 

PDEINVITEYKEHFGFPYVFGKNQKNYGFYNKAPKNTKFIPSIYELPAHVAPLGIHFYRGNNFPKEYINKLFIAEH 

GSWNRSSPVGYKITTLDIDSKTRTARNYKTFLYGFLKHDKSKFGRPVDIITYYDGSILFTDDFGNKIYRVYYEKI 



t943 .aa 

EITSTLKVPNGFKVEIFLNNTIEKPRGITSDQDGNIFIGSGSTFAYFVTKNRKIYTIAKTLQKPIGIDYWDNKLYI 
SSVDKIYVVKl^n/KEEINKSIKSHKDYTWKMQIFALLPKNNSQ!fflSGRYIK\^ 

VILSINLKTKKEEIVAFGVRNSVGFDFHPISNEIYFSDNGQDGLGDNIPPDEINVITEYKEHFGFPYVFGKNQKNY 
GFYNKAPKNTKFIPSIYELPAHVAPLGIHFYRGNNFPKEYINKLFIAEHGSWNRSSPVGYKITTLDIDSKTRTARN 
YKTFLYGFLKHDKSKFGRPVDIITYYDGSILFTDDFGNKIYRVYYEKI 



f943.nt 



ATGAAAAATCAATTTTTAAATAGCTATTTTCAATTAATTACAACTATTTTCTTAATCTCATCTATAACTATTGCAG 
CAGAAGAAATAACAAGCACACTAAAAGTTCCTAATGGATTTAAAGTCGAAATTTTTTTAAACAATACAATTGAAAA 
ACCTAGAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGGATCTGGAAGCACTTTTGCATACTTTGTAACA 
AAAAACAGAAAAATTTATACCATAGCAAAAACCCTGCAAAAACCTATTGGTATTGATTATTGGGATAATAAACTCT 
ACATATCTTCTGTCGATAAAATATATGTAGTTAAAAATGTAAAAGAAGAAATTAATAAAAGCATAAAATCACATAA 
AGACTATACATGGAAAATGCAAATTTTTGCACTTTTGCCAAAAAATAATTCTCAAATGCACTCAGGACGTTACATT 
AAAGTAGATTCTAAAAATAACAAATTAATAGTAAATATAGGATCCCAGCACAATGTTAAAATTCCCCCAAAAAAAG 
AAGCAGTAATCCTTAGTATTAATTTAAAAACAAAAAAAGAAGAAATAGTAGCTTTTGGAGTGAGAAACTCAGTTGG 
GTTTGATTTTCACCCAATTAGCAATGAAATATATTTTAGCGAC/^TGGCCAAGACGGATTAGGAGACAACATTCC^ 
CCAGATGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTATGTGTTTGGAAAAAATCAAAAA^ 
ATTACGGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATCTATTTACGAACTTCCCGCACATGTAGC 
TCCACTTGGAATACACTTTTACCGGGGAAATAACTTTCCAAAAGAATACATAAATAAATTATTCATAGCAGAACAC 
GGCTCGTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACAACACTAGACATTGATTCTAAAACCAGAACAGCA^ 
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GAAATTACAAGACTTTTTTATATGGATTTTTAAAGCACGACAAATCTAAATTTGGACGCCCTGTTGATATAATCAC 
ATATTATGACGGTTCAATTCTTTTTACAGATGACTTTGGAAATAAAATATACAGAGTTTACTACGAAAAGATTTAA 



t943.nt 



GAAATAACAAGCACACTAAT^GTTCCTAATGGATTTAAAGTCGAAATTTTTTTAAACAATACAATTGAAAAACCTA 
GAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGGATCTGGAAGCACTTTTGCATACTTTGTAACAAAA^ 
CAGAAAAATTTATACCATAGCAAAAACCCTGCAAAAACCTATTGGTATTGATTATTGGGATAATAAACTCTACATA 
TCTTCTGTCGATAAAATATATGTAGTTAAAAATGTAAAAGAAGAAATTAATAAAAGCATAAAATCACATAAAGACT 
ATACATGGAAAATGCAAATTTTTGCACTTTTGCCAAAAAATAATTCTCAAATGCACTCAGGACGTTACATTAAAGT 
AGATTCTAAAAATAACAAATTAATAGTAAATATAGGATCCCAGCACAATGTTAAAATTCCCCCAAAAAAAGAAGCA 
GTAATCCTTAGTATTAATTTAAAAACAAAAAAAGAAGAAATAGTAGCTTTTGGAGTGAGAAACTCAGTTGGGTTTG 
ATTTTCACCCAATTAGCAATGAAATATATTTTAGCGACAATGGCCAAGACGGATTAGGAGACAACATTCCCCCAGA 
TGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTATGTGTTTGGAAAAAATCAAAAAAATTAC 
GGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATCTATTTACGAACTTCCCGCACATGTAGCTCCAC 
TTGGAATACACTTTTACCGGGGAAATAACTTTCCAAAAGAATACATAAATAAATTATTCATAGCAGAACACGGCTC 
GTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACAACACTAGACATTGATTCTAAAACCAGAACAGCAAGAAAT 
TACAAGACTTTTTTATATGGATTTTTAAAGCACGACAAATCTAAATTTGGACGCCCTGTTGATATAATCACATATT 
ATGACGGTTCAATTCTTTTTACAGATGACTTTGGAAATAAAATATACAGAGTTTACTACGAAAAGATTTAA 



f952.aa 



MNYARFAVLIVLLFFYIWFFIILRMKRTNLFLLEKIQNGAKILDIRSPKEYSKSHYLKSINIPFNNLFAKKDKLGD 
FESPIIVYGKSFNKSYEAKKVLKSMGFKNVFVAGTLKDMPQAKKEVG 



t952 .aa 



RMKRTNLFLLEKIQNGAKILDIRSPKEYSKSHYLKSINIPFNNLFAKKDKLGDFESPIIVYGKSFNKSYEAKKVLK 
SMGFKNVFVAGTLKDMPQAKKEVG 



f952.nt 



ATGAATTATGCAAGATTTGCAGTATTAATAGTTCTGCTTTTTTTTTATATTTGGTTTTTTATTATCCTTAGGATC 

AAAGAACTAATCTGTTTTTGTTAGAAAAAATCCAAAATGGAGCAAAAATTTTGGATATTCGGTCTCCCAAAGAAT^ 

TAGCAAGTCTCATTATTTGAAGTCAATTAACATTCCTTTTAATAATTTATTTGCTAAAAAGGATAAATTAGGTGAT 

TTTGAGTCCCCAATAATTGTTTATGGTAAAAGTTTTAATAAGTCTTACGAGGCTAAAAAAGTTTTAAA 

GATTTAAGAATGTGTTTGTTGCTGGAACCTTGAAAGACATGCCACAAGCAAAAAAAGAAGTTGGTTGA 



t952.nt 



AGGATGAAAAGAACTAATCTGTTTTTGTTAGAAAAAATCCAAAATGGAGCAAAAATTTTGGAT^ 
AAGAATATAGCAAGTCTCATTATTTGAAGTCAATTAACATTCCTTTTAATAATTTATTTGCTAAAAAGGATAAATT 
AGGTGATTTTGAGTCCCCAATAATTGTTTATGGTAAAAGTTTTAATAAGTCTTACGAGGCTAAAAAAGTTTTAAAA 
AGCATGGGATTTAAGAATGTGTTTGTTGCTGGAACCTTGAAAGACATGCCACAAGCAAAAAAAGAAGTT^ 
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TABLE 1. Nucleotide and Amino Acid Sequences 
f378.aa 



MIKKFLLFAMLNIFLTNKAHSNEEIIEISTEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELDKNYVNTVAKTYI 
DESLIEGVNYDIAYAQMLLETGALKFNGIVSKEQHNFSGIGATNNLTKGNSFSNITEGIKAHIQHLKAYASKQNIK 
SNMVDPRFYLVKRGSAPTIYDLTGKWAKDKLYDKKLKKILLELLEYNNANKS 



t378 .aa 



NEEIIEISTEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELDKNYVNTVAKTYIDESLIEGVNYDIAYAQMLLET 
GALKFNGIVSKEQHNFSGIGATNNLTKGNSFSNITEGIKAHIQHLKAYASKQNIKSNMVDPRFYLVKRGSAPTIYD 
LTGKWAKDKLYDKKLKKI LLELLEYNNANKS 



f378.nt 



ATGATAAAAAAATTCTTGCTATTTGCAATGCTCAACATCTTTTTAACAAATAAAGCTCATAGTAATGAAGAGATAA 
TCGAAATAAGTACTGAAATACAAAAGGAAAAATATATTCCCTTTTTAATAAGTAGAGGAAAAACTCAACTAGAAGA 
CCTTGTAAAATATACTCTAGAAATAAATCCAGAGCTTGACAAAAACTATGTAAATACTGTTGCTAAAACCTATATA 
GACGAATCTTTGATTGAAGGGGTTAATTATGACATTGCCTATGCTCAAATGTTACTAGAAACAGGAGCTCTAAAAT 
TCAATGGAATAGTTTCAAAAGAACAACACAATTTTTCAGGAATAGGCGCTACTAATAATCTTACAAAAGGAAATTC 
TTTTTCCAATATTACAGAAGGAATTAAAGCTCATATTCAACATTTAAAAGCTTATGCTTCAAAACAAAATATCAAA 
TCAAATATGGTTGATCCTAGATTTTACCTTGTTAAAAGAGGATCTGCTCCAACAATATATGATTTGACTGGGAAAT 
GGGCAAAAGACAAACTTTACGACAAAAAACTTAAAAAAATATTATTAGAACTATTAGAATATAATAATGCAAATAA 
AAGCTAA 



t378 .nt 

AATGAAGAGATAATCGAAATAAGTACTGAAATACAAAAGGAAAAATATATTCCCTTTTTAATAAGTAGAGGAAAAA 
CTCAACTAGAAGACCTTGTAAAATATACTCTAGAAATAAATCCAGAGCTTGACAAAAACTATGTAAATACTGTTGC 
TAAAACCTATATAGACGAATCTTTGATTGAAGGGGTTAATTATGACATTGCCTATGCTCAAATGTTACTAGAAACA 
GGAGCTCTAAAATTCAATGGAATAGTTTCAAAAGAACAACACAATTTTTCAGGAATAGGCGCTACTAATAATCTTA 
CAAAAGGAAATTCTTTTTCCAATATTACAGAAGGAATTAAAGCTCATATTCAACATTTAAAAGCTTATGCTTCAAA 
ACAAAATATCAAATCAAATATGGTTGATCCTAGATTTTACCTTGTTAAAAGAGGATCTGCTCCAACAATATATGAT 
TTGACTGGGAAATGGGCAAAAGACAAACTTTACGACAAAAAACTTAAAAAAATATTATTAGAACTATTAGAATATA 
ATAATGCAAATAAAAGCTAA 



f4.aa 



mklfrrnvmikmpssftiifslivfvtiltyvipagkfdkefkqmgdgskreiivagtyqyvdrgsrgflhpimti 

ltamskgmehavevivfvlivggaygiimktgaidvgiyflikklghkdkllipllmfifsiggtvtgmseetlpf 

yfvmiplivalgydslvgaaiialgagvgtmastvnpfatgiasaiasislqix3fyfrivlyfvsvlaait 

askikkdpskslvysqkdehyqyfvkkdglstgdnaqnaleftfahklvlllfgfmililifsivnlgwwm 

lylgvaiisaficklgetemwdafvkgseslltaalvigijmgvmivcdixslitdtmij^aatn^ 

neiiqifigfwpsssghasltmpimapladflsipraswiamqtasglinlitptsgvii^vlgisrlsyg™ 

kfvlplfmieffisilviianiylsf 



t4 .aa 
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KFDKEFKQMGDGSKREIIVAGTYQYVDRGSRGFLHPIMTILTAMSKGMEHAVEVIVFVLIVGGAYGIIMKTGAIDV 

GIYFLIKKLGHKDKLLIPLLMFIFSIGGTVTGMSEETLPFYFVMIPLIVALGYDSLVGAAIIALGAGVGTMASTVN 

PFATGIASAIASISLQDGFYFRIVLYFVSVLAAITYVCVYASKIKKDPSKSLVYSQKDEHYQYFVKKDGLSTGDNA 

QNALEFTFAHKLVLLLFGFMI L I L I F S I VNLGWWMQEMTMLYLGVAI I S AF I CKLGETEMWDAFVKGSESLLTAAL 

VIGIARGVMIVCDDGLITDTMl^AATNFLYNLPRPLFIILNEIIQIFIGFWPSSSGHASLTMPIMAPI^ 

RASWIAMQTASGLINLITPTSGVIMAVLGISRLSYGTWFKFVLPL.FMIEFFISILVIIANIYLSF 



f4 .nt 



ATGAAATTATTTAGGAGAAACGTTATGATCAAAATGCCAAGTAGTTTTACAATAATATTTTCTTTAATTGTATTTG 
TTACCATTTTAACGTATGTGATTCCTGCCGGTAAGTTTGATAAAGAATTTAAGCAAATGGGTGATGGATCTAAAAG 
GGAAATAATTGTTGCTGGAACTTATCAATATGTAGATCGAGGCTCTAGGGGATTTTTACATCCTATTATGACTATT 
TTAACCGCAATGTCAAAGGGGATGGAACATGCAGTTGAAGTTATTGTTTTTGTTTTAATTGTTGGGGGTGCTTATG 
GGATTATTATGAAAACTGGAGCAATAGATGTGGGAATTTATTTTTTAATCAAGAAGTTGGGGCACAAAGATAAGTT 
GCTTATTCCTTTGTTAATGTTTATTTTTTCAATTGGTGGAACTGTAACCGGAATGAGTGAAGAGACCCTTCCTTTT 
TATTTTGTTATGATTCCCTTGATAGTAGCTTTGGGTTATGATAGTCTTGTTGGAGCGGCTATTATTGCTTTAGGAG 
CTGGAGTGGGAACTATGGCTTCTACTGTAAATCCATTTGCGACAGGAATTGCATCTGCAATAGCTTCTATTAGCTT 
GCAGGATGGATTTTATTTTAGAATTGTTCTTTATTTTGTATCAGTATTGGCTGCTATAACCTATGTTTGTGTTTAT 
GCGTCTAAAATTAAAAAGGATCCCTCAAAATCGCTTGTGTATTCTCAAAAAGATGAACATTATCAATATTTTGTTA 
AAAAAGATGGACTTTCTACCGGAGATAATGCTCAGAATGCTCTTGAGTTTACTTTTGCTCATAAATTAGTTTTACT 
TTTATTTGGATTTATGATATTGATTTTGATATTTAGCATTGTTAATCTTGGTTGGTGGATGCAAGAAATGACAATG 
TTGTATCTTGGAGTTGCTATTATATCGGCTTTTATTTGTAAATTAGGTGAAACTGAAATGTGGGATGCGTTTGTGA 
AAGGTTCTGAAAGTCTGCTAACCGCTGCTCTTGTTATTGGACTTGCTAGAGGTGTTATGATAGTATGTGATGATGG 
GTTGATTACAGATACTATGTTAAATGCTGCTACTAATTTTTTATACAATCTTCCAAGACCCCTTTTTATCATATTG 
AATGAAATTATTCAAATATTTATAGGATTTGTTGTTCCATCTTCATCAGGACATGCTAGTCTCACTATGCCAATAA 
TGGCTCCTCTTGCCGATTTTTTGTCAATTCCAAGAGCTTCAGTTGTTATTGCCATGCAGACTGCATCTGGGCTTAT 
TT^TTTGATAACACCTACCAGCGGAGTTATAATGGCTGTATTGGGGATATCCAGATTGAGTTATGGTACGTGGTTT 
AAGTTTGTTTTACCATTATTTATGATTGAGTTTTTTATCTCTATTTTAGTTATTATAGCTAACATTTATTTAAGTT 
TTTAG 



t4 ,nt 



AAGTTTGATAAAGAATTTAAGCAAATGGGTGATGGATCTAAAAGGGAAATAATTGTTGCTGGAACTTATCAATATG 
•TAGATCGAGGCTCTAGGGGATTTTTACATCCTATTATGACTATTTTAACCGCAATGTCAAAGGGGATGGAACATGC 
AGTTGAAGTTATTGTTTTTGTTTTAATTGTTGGGGGTGCTTATGGGATTATTATGAAAACTGGAGCAATAGATGTG 
GGAATTTATTTTTTAATCAAGAAGTTGGGGCACAAAGATAAGTTGCTTATTCCTTTGTTAATGTTTATTTTTTCAA 
TTGGTGGAACTGTAACCGGAATGAGTGAAGAGACCCTTCCTTTTTATTTTGTTATGATTCCCTTGATAGTAGCTTT 
GGGTTATGATAGTCTTGTTGGAGCGGCTATTATTGCTTTAGGAGCTGGAGTGGGAACTATGGCTTCTACTGTAAAT 
CCATTTGCGACAGGAATTGCATCTGCAATAGCTTCTATTAGCTTGCAGGATGGATTTTATTTTAGAATTGTTCTTT 
ATTTTGTATCAGTATTGGCTGCTATAACCTATGTTTGTGTTTATGCGTCTAAAATTAAAAAGGATCCCTCAAAATC 
GCTTGTGTATTCTCAAAAAGATGAACATTATCAATATTTTGTTAAAAAAGATGGACTTTCTACCGGAGATAATGCT 
CAGAATGCTCTTGAGTTTACTTTTGCTCATAAATTAGTTTTACTTTTATTTGGATTTATGATATTGATTT^ 
TTAGCATTGTTAATCTTGGTTGGTGGATGCAAGAAATGACAATGTTGTATCTTGGAGTTGCTATTATATCGGCTT^ 
TATTTGTAAATTAGGTGAAACTGAAATGTGGGATGCGTTTGTGAAAGGTTCTGAAAGTCTGCTAACCGCTGCTCT^ 
GTTATTGGACTTGCTAGAGGTGTTATGATAGTATGTGATGATGGGTTGATTACAGATACTATGTTAAATGCTGCTA 
CTAATTTTTTATACAATCTTCCAAGACCCCTTTTTATCATATTGAATGAAATTATTCAAATATTTATAGGATTTGT 
TGTTCCATCTTCATCAGGACATGCTAGTCTCACTATGCCAATAATGGCTCCTCTTGCCGATTTTTTGTCAATTCCA 
AGAGCTTCAGTTGTTATTGCCATGCAGACTGCATCTGGGCTTATTAATTTGATAACACCTACCAGCGGAGTTATAA 
TGGCTGTATTGGGGATATCCAGATTGAGTTATGGTACGTGGTTTAAGTTTGTTTTACCATTATTTATGATTGAGTT 
TTTTATCTCTATTTTAGTTATTATAGCTAACATTTATTTAAGTTTTTAG 
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f43 .aa 



MKYFYFLFFLLIFNVYAQNVNSPALPSPPLLPEITENKPVERENSSKGENFSNVGLDGKYVNDTILYGLDSQVTSI 

IKALKKSSDSQYNFSLKKRLEKTFNAELKREILELFISLKYSGGIDTANYILENYESKRYSNALFGLAISYLKEFD 

DKEKLKKTLIDILENKEGNWSIAAYYLGELNSLEYSKNMMEVFEKYSGNIX5ARREILIALGKM 

SLDNYEGPSIKAAAIEALSYLASDKVTENADLYLQSNNNNLNVKLAIIASLSKDPSLKSKEILQGFLRDSDD 

KAINAIKGHRDSSAKDILIYKLKSDPSLKVREASAKALIDMDLGNIEIKNIMFDFKIDNNFKISMFSYLLDKDSLK 

ALSIALEIVNKENINRPSIWLRGVASMLAGKKGNFDNFYSKIIDSKNIDLRHLALKGAVYNKSSSLSDKLKKIKSE 

TNSEYIKMLLKDY 



t43 .aa 



LPSPPLLPEITENKPVERENSSKGENFSNVGLDGKYVNDTILYGLDSQVTSIIKALKKSSDSQYNFSLKKRLEKTF 

NAELKREILELFISLKYSGGIDTANYILENYESKRYSNALFGLAISYLKEFDDKEKLKKTLIDILENKEGNWSIA 

AYYLGELNSLEYSKNMMEVFEKYSGNIXSARREILIALGKMSAVDYQDRIYEISLDNYEGPSIKAAAIEA^ 

KVTENADLYLQSNNNNLWKLAIIASLSKDPSLKSKEILQGFLRDSDDNIRFKAINAIKGHRDSSAKDILIYKLKS 

DPSLKWEASAKALIDMDLGNIEIKNIMFDFKIDNNFKISMFSYIiLDKDSLKALSIALEIVNKENINRPSNVLRG^ 

ASMLAGKKGNFDNFYSKIIDSKNIDLRHLALKGAVYNKSSSLSDKLKKIKSETNSEYIKMLLKDY 



f43 .nt 



ATGAAATACTTTTATTTTTTATTTTTTTTACTTATTTTTAATGTGTATGCTCAAAATGTTAATTCTCCAGCTCTTC 

CTAGTCCGCCTTTGTTGCCCGAAATTACAGAAAATAAGCCTGTTGAGAGAGAAAATTCTTCTAAGGGAGAGAATTT 

TTCTAATGTTGGTTTAGATGGTAAGTATGTTAACGATACAATTCTTTATGGGCTTGATAGTCAAGTGACAAGCATT 

ATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAATTTTTCTCTTAAAAAAAGACTTGAGAAAACTTTTAATG 

CTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTCTCTTAAGTATTCGGGGGGCATTGATACAGCAAATTATAT 

TCTTGAAAATTATGAGAGTAAAAGATATTCAAACGCTTTATTTGGCTTGGCAATTTCGTATCTTAAGGAGTTTGAT 

GATAAAGAAAAATTAAAAAAAACTCTTATTGACATTCTTGAAAATAAAGAGGGCAATGTGGTATCTATTGCAGCTT 

ATTATTTAGGAGAGCTTAATTCTCTTGAGTATTCTAAAAACATGATGGAAGTTTTTGAAAAATATTCTGGAAATGA 

TGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTCCGCTGTTGATTATCAGGATAGAATTTATGAAATT 

TCGCTAGATAATTACGAGGGCCCATCAATTAAGGCTGCTGCAATCGAAGCGTTGTCATATCTTGCTTCAGATAAAG 

TAACTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAATAATTTAAATGTTAAATTAGCTATTATTGCTTCTTT 

GTCCAAAGATCCTTCTTTAAAGTCTAAAGAGATTTTACAAGGATTTTTAAGAGATTCTGATGATAATATTAGGTTT 

AAAGCTATTAATGCAATCAAAGGACATAGGGACTCTTCTGCAAAGGATATTTTGATTTATAAGCTTAAAAGCGATC 

CATCTCTTAAAGTTAGGGAGGCTTCTGCTAAGGCCTTAATTGATATGGATCTTGGGAATATTGAGATAAAAAA^ 

TATGTTTGATTTTAAGATTGACAATAATTTTAAAATTTCAATGTTTAGTTACCTTTTAGATAAGGATTCTCTAAAA 

GCATTGTCAATTGCTTTAGAAATTGTTAATAAAGAAAATATTAATAGACCCTCAAATGTTTTAAGGGGCGTTGCTT 

CAATGTTGGCTGGTAAAAAGGGTAATTTTGATAATTTTTATTCTAAAATCATTGACAGCAAAAATATTGATTTAAG 

GCATTTAGCATTAAAAGGAGCTGTTTATAATAAATCTTCATCGCTTTCTGATAAGCTTAAAAAAATTAAAAGTGAA 

ACGAACTCCGAATATATTAAAATGCTTTTAAAAGATTATTGA 



t43 .nt 



CTTCCTAGTCCGCCTTTGTTGCCCGAAATTACAGAAAATAAGCCTGTTGAGAGAGAAAATTCTTCTAAGGGAGAGA 
ATTTTTCTAATGTTGGTTTAGATGGTAAGTATGTTAACGATACAATTCTTTATGC^ 

CATTATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAATTTTTCTCTTAAAAAAAGACTTGAGAAAACTTTT 

AATGCTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTCTCTTAAGTATTCGGGGGGCATTC 

ATATTCTTGAAAATTATGAGAGTAAAAGATATTCAAACGCTTTATTTGGCTTGGCAATTTCGTATC 
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TGATGATAAAGAAAAATTAAAAAAAACTCTTATTGACATTCTTGAAAATAAAGAGGGCAATGTGGTATCTATTGCA 
GCTTATTATTTAGGAGAGCTTAATTCTCTTGAGTATTCTAAAAACATGATGGAAGTTTTTGAAAAATATTCTGGAA 
ATGATGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTCCGCTGTTGATTATCAGGATAGAATTTATGA 
AATTTCGCTAGATAATTACGAGGGCCCATCAATTAAGGCTGCTGCAATCGAAGCGTTGTCATATCTTGCTTCAGAT 
AAAGTAACTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAATAATTTAAATGTTAAATTAGCTATTATTGCTT 
CTTTGTCCAAAGATCCTTCTTTAAAGTCTAAAGAGATTTTACAAGGATTTTTAAGAGATTCTGATGATAATATTAG 
GTTTAAAGCTATTAATGCAATCAAAGGACATAGGGACTCTTCTGCAAAGGATATTTTGATTTATAAGCTTAAAAGC 
GATCCATCTCTTAAAGTTAGGGAGGCTTCTGCTAAGGCCTTAATTGATATGGATCTTGGGAATATTGAGATAAAAA 
ACATTATGTTTGATTTTAAGATTGACAATAATTTTAAAATTTCAATGTTTAGTTACCTTTTAGATAAGGATTCTCT 
AAJ\AGCATTGTCAATTGCTTTAGAAATTGTTAATAAAGAAAATATTAATAGACCCTCAAATGTTTTAAGGGGCGTT 
GCTTCAATGTTGGCTGGTAAAAAGGGTAATTTTGATAATTTTTATTCTAAAATCATTGACAGCAAAAATATTGATT 
TAAGGCATTTAGCATTAAAAGGAGCTGTTTATAATAAATCTTCATCGCTTTCTGATAAGCTTAAAAAAATTAAAAG 
TGAAACGAACTCCGAATATATTAAAATGCTTTTAAAAGATTATTGA 



fSO.aa 



MKFVLNNLFKGCLICFFLFFSCLTTDRSIQDSHISDIVEKKKEAVIIDDNNWLGSNEGKFKRDYLIGLKDNESFF 

LSDAFLKENNFYFKKARESYAKKNIGLTNYYLNKIVTNENQHSRELIJUCANLFFGYVNYENGFYDLSEY^ 

DYKYSHASLRLAELKYLVKEKSDAISAFKEINEFSISGYDREIYGFLSNKLGVSHLNLESLGFLDNSVFDTFVFND 

NIFVTNILGGLLRYNIKKiroCRVYLKDKKSIFLNGIRGFADYNGTIYIGGKiaVWYIDDV^ 

WQVLLAVKNGIFVGTLNSGLWFYDLKNWKKIPLGSNKISSLCFDSLKNLLLVGTVDKAIYSV^^ 

FSKIODNEKNINFIKEYKDSYFVGTYGGGLFELNLNKNSYKKHVIANNIDVNYFI^MEIKDKKLLFATFD^^ 

SENDNWDYFGPNNGLLNLNLIKVSRFENYVILGTINNGLVFVDENIKKQL 



t50 .aa 



CLTTDRSIQDSHISDIVEKKKEAVIIDDNNWIiGSNEGKFKRDYLIGLKDNESFFLSDAFLKENNFYFKKARESYA 
KKNIGLTNYYLNKIVTNENQHSRELIJUCANLFFGYVNYENGFYDLSEYNFDIiFLKDYK^ 

SDAISAFKEINEFSISGYDREIYGFLSNKLGVSHLNLESLGFLDNSVFDTFVFNDNIFVTNILGGLLRYNIKKNDC 

RVYLKDKKSIFLNGIRGFADYNGTIYIGGKNVVYYIDDVDGDLKQIIWPGNADFSNVQVLLAVKNGIFVGT 

WFYDLKNWKNIPLGSNKISSLCFDSLKJSILLLVGTVDKAIYSVNVDNLKKIEHLDFFS^ 

VGTYGGGLFELNLNKNSYKKHVIANNIDVlSry'FMDMEIKDKKLLFATFDHGLLIYDSENDIWDYFG^ 

KVSRFENYVILGTINNGLVFVDENIKKQL 



fSO.nt 



atgaaatttgttttgaataatttatttaaaggttgtcttatatgttttttcttgtttttttcctgccttact^ 

atagatctattcaagattctcatattagtgatattgtagagaagaaaaaagaagcagtcattattgatgat;^ 

tgttgttcttgggagtaatgagggtaaatttaaaagagactatttgataggattaaaagataatgaatcttttttt 

cttagtgatgcttttttaaaagaaaataatttttattttaaaaaagccagggaaagttatgctaaaaaaaatattg 

gcttgacaaattattatttgaataaaatagtaactaatgagaatcagcacagcagagaattgctagctaaagcgaa 

tttgttttttggatatgtaaattatgagaatggtttttatgatctttccgaatataattttgatctatttttaaaa 

gactataaatattctcatgctagtttaagattagctgaattaaaatatcttgttaaagaaaaatctgatgcaattt 

ctgcatttaaagagattaatgaattttctatctcaggttatgatagagagatit'atggctttttaagtaata;^ 

tggagtaagtcatttaaacttagagtctttaggatttcttgacaacagcgtttttgatacatttgtctttaatgac 

aatatatttgtaactaatatattgggagggcttttaagatataatattaaaaaaaatgattgtagagtctatctta 

aggataaaaaaagcatttttttaaatggcattaggggttttgcggattataatggaacaa^ 

aaatgttgtttattatatagatgatgttgatggggatttaaagcaaataaatgttc 

aatgtacaagttttgcttgctgttaaaaatggaatatttgttggcactctaaa™ 

taaaaaattggaaaaatataccgcttggatctaataaaatttcttcactctgctttgatag 
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ATTAGTTGGAACAGTTGACAAGGCTATTTATAGTGTTAATGTCGATAATTTGAAAAAGATTGAACATTTGGATTTT 

TTTAGCAAAAATGATAATGAAAAAAATATTAATTTTATAAAAGAATATAAAGATAGTTATTTTGTTGGAACATATG 

GTGGGGGTCTTTTTGAATTAAATTTAAATAAAAATAGTTACAAAAAGCACGTTATTGCCAATAATATTGATGTTAA 

TTATTTTATGGATATGGAGATTAAAGATAAAAAGCTATTGTTTGCAACCTTTGATCATGGGTTATTGATTTATGAT 

TCTGAAAATGACAACTGGGATTATTTTGGACCCAATAATGGGCTTCTTAATTTGAATTTAATAAAAGTTTCTAGAT 

TTGAAAATTATGTCATACTGGGCACTATTAATAACGGTTTGGTTTTTGTAGATGAAAATATTAAAAAACAGT^^ 

A 



tSO .nt 



TGCCTTACTACAGATAGATCTATTCAAGATTCTCATATTAGTGATATTGTAGAGAAGAAAAAAGAAGCAGTCATTA 
TTGATGATAATAATGTTGTTCTTCGGAGTAATGAGGGTAAATTTAAAAGAGACTATTTGATAGGATTAAAAGATA^ 
TGAATCTTTTTTTCTTAGTGATGCTTTTTTAAAAGAAAATAATTTTTATTTTAAAAAAGCCAGGGAAAGTTATGCT 
AAAAAAAATATTGGCTTGACAAATTATTATTTGAATAAAATAGTAACTAATGAGAATCAGCACAGCAGAGAATTGC 
TAGCTAAAGCGAATTTGTTTTTTGGATATGTAAATTATGAGAATGGTTTTTATGATCTTTCCGAATATAATTTTGA 
TCTATTTTTAAAAGACTATAAATATTCTCATGCTAGTTTAAGATTAGCTGAATTAAAATATCTTGTTAAAGAAAAA 
TCTGATGCAATTTCTGCATTTAAAGAGATTAATGAATTTTCTATCTCAGGTTATGATAGAGAGATTTATGGCTTTT 
TAAGTAATAAACTTGGAGTAAGTCATTTAAACTTAGAGTCTTTAGGATTTCTTGACAACAGCGTTTTTGATACATT 
TGTCTTTAATGACAATATATTTGTAACTAATATATTGGGAGGGCTTTTAAGATATAATATTAAAAAAAATGATTGT 
AGAGTCTATCTTAAGGATAAAAAAAGCATTTTTTTAAATGGCATTAGGGGTTTTGCGGATTATAATGGAACAATTT 
ATATTGGTGGTAAAAATGTTGTTTATTATATAGATGATGTTGATGGGGATTTAAAGCAAATAAATGTTCCCGGTAA 
TGCTGATTTTAGCAATGTACAAGTTTTGCTTGCTGTTAAAAATGGAATATTTGTTGGCACTCTAAATTCTGGATTA 
TGGTTTTATGATTTAAAAAATTGGAAAAATATACCGCTTGGATCTAATAAAATTTCTTCACTCTGCTTTGATAGTT 
TAAAAAATTTATTATTAGTTGGAACAGTTGACAAGGCTATTTATAGTGTTAATGTCGATAATTTGAAAAAGATTGA 
ACATTTGGATTTTTTTAGCAAAAATGATAATGAAAAAAATATTAATTTTATAAAAGAATATAAAGATAGTTATTTT 
GTTGGAACATATGGTGGGGGTCTTTTTGAATTAAATTTAAATAAAAATAGTTACAAAAAGCACGTTATTGCCAATA 
ATATTGATGTTAATTATTTTATGGATATGGAGATTAAAGATAAAAAGCTATTGTTTGCAACCTTTGATCATGGGTT 
ATTGATTTATGATTCTGAAAATGACAACTGGGATTATTTTGGACCCAATAATGGGCTTCTTAATTTGAATTTAATA 
AAAGTTTCTAGATTTGAAAATTATGTCATACTGGGCACTATTAATAACGGTTTGGTTTTTGTAGATGAAAATATTA 
AAAAACAGTTATGA 



f65.aa 

MHIFKNVPFQINLILFLLVSVAKINASSKFYYAEQWYVIFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSLTKI 

ETKEQWEKYKLLFKMHVNLLLWQNLHLGDLFOTRISrLYFFKTPEKDGIISNLEKSKKLYKLAIN^ 

ENYTTVKLENDGITNWEDEYHKISLKELNYYDIIKKELLRIDETKAFFEQGPNYY 

t65 .aa 

KINASSKFYYAEQWYVIFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSLTKIETKEQWEKYKLLFKl^^ 

RQNLHLGDLFDTRNLYFFKTPEKDGIISNLEKSKKLYKIAINYYSEALKYHKKLEiry'TTVKLENrX^ 

ISLKELNYYDIIKKELLRIDETKAFFEQGPNYY 

f65.nt 

ATGCATATTTTCAAAAATGTCCCCTTCCAAATAAATTTAATTTTATTTCTTTTAGTATCAGTTGCAAAGATAAATG 

CATCGTCCAAATTTTATTACGCAGAACAATGGTATGTAATTTTTAATTCTCAAATGAAAAAAAAACCTGAAAACTA 

TAAAAAAAATATATTTTTTCTTCAAAAAGCCTTAAAATACCCATTTGGAAATCCAAAATATTCTCTAACTAAAATA 

GAAACCAAAGAACAGTGGGAAAAATATAAACTTCTTTTCAAAATGCATGTAAACTTGCTTCTAGTTAGGCAAAATT 

TACATTTAGGAGATTTATTCGACACAAGAAATTTATATTTTTTCAAAACTCCAGAAAAAGATGGAA^ 

TCTAGAAAAATCAAAAAAATTATATAAACTAGCTATTAATTACTACAGCGAAGCACTAAAATACCACAAAAAACTT 

GAAAATTACACAACTGTTAAACTAGAAAACGATGGAATAACAAACTGGGAAGATGAATATCATAAAATTTCTCT^ 
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AAGAGCTTAATTACTATGACATTATTAAAAAAGAACTACTAAGAATTGACGAAACTAAAGCATTTTTTGAACAAGG 
GCCAAACTATTATTAA 

t65 .nt 

KINASSKFYYAEQWWIFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSLTKIETKEQWEKYKLLFKMHW 
RQNLHLGDLFDTRNLYFFKTPEKDGIISNLEKSKKLYKLAI^^!fYSEALKyHKKLENYTTVKLENDGITNWEDEra 
ISLKELNYYDI IKKELLRIDETKAFFEQGPNYY 



f 8 .aa 



MKNINRLILLILTTHTLLFSCALI7a:>NKSKNLSTSEIILTQKTLLESSLIKNPSlWEYRIPISSIQEILlS^^ 
LIKKTAAKIKISPQKLEEIKNYLNAYKlSnfLlSnSfETEWIKFIDQSSVNGNLTI 

lELQMHLEKEILNLIEQTFHDKNLGYIQLSHINSFFPQENINSITKEIIDGKEYIAPHIIANQLLKIKDKKYFEQF 
MHFLKVENSKIKTIIEKQKISDLHNELYYSKQSPPRRRKRSTADSDNNNKYDIIPKIIDPNTGIEITPKNLRSILS 
NGDIILIKPKIDWTEFFYFWQHVGIFDEEKYEATKKIAFNGIDSFDIKSIITSNQIKFDTASTQGSGYEKLSTYVQ 
SRILKIFSPITDIRTIQKAINFGRSRYIDNNFGYMVPLISSNLWTDSFNLEEIHNKTYCSLMVDRIYKIAGLNVSR 
NYEISGIITPGEINAAAYNFYMSYTIAGILPSVLPKRLIKPTLKEKFIGYNKEIVDAIELKKSKEKIFGRACNITN 
LWCSGS 



t8 .aa 



CALIADNKSKNLSTSEIILTQKTLLESSLIKNPSNVEYRIPISSIQEILNNNNDSFLIKKTAAKIKISPQKLEEIK 

NYLNAYKNYLNNETEWIKFIDQSSVNGNLTIKIDTAFEKKTNFNHTNSDNENLTELIELQMHLEKEILNLIEQTFH 

DKNLGYIQLSHINSFFPQENINSITKEIIDGKEYIAPHIIANQLLKIKDKKYFEQFMHFLKVENSKIKTIIEKQKI 

SDLHNELYYSKQSPPRRRKRSTADSDNNNKYDIIPKIIDPNTGIEITPKNLRSILSNGDIILIKPKIDWTEFFYFW 

QHVGIFDEEKYEATKKIAFNGIDSFDIKSIITSNQIKFDTASTQGSGYEKLSTYVQSRILKIFSPITDIRTIQKAI 

NFGRSRYID^^S^FGYMVPLISSNLWTDSFNLEEIHNKTYCSLMVDRIYKIAGL^rVSRNYEISGIITP 

YMSYTIAGILPSVLPKRLIKPTLKEKFIGYNKEIVDAIELKKSKEKIFGRACNITNLWCSGS 



f 8 .nt 



ATGAAGAATATTAATAGATTAATATTATTAATATTAACTACACACACTTTATTATTCTCTTGTGCCTTAATTGCAG 

ATAATAAGTCAAAAAATTTAAGCACATCAGAAATCATATTAACACAAAAAACACTACTAGAAAGCTCTTTAATAAA 

AAATCCTTCTAATGTAGAATATCGAATACCAATATCCAGTATCCAAGAAATTTTAAACAATAACAATGATTCTTTT 

TTAATAAAAAAAACAGCAGCAAAAATCAAAATAAGCCCTCAAAAACTTGAAGAAATAAAAAACTATCTAAATGCTT 

ATAAAAATTATCTAAATAATGAAACAGAATGGATAAAGTTTATAGATCAAAGTAGCGTCAATGGAAATTTAACAAT 

TAAAATTGATACTGCTTTTGAAAAAAAAACAAATTTOAATCATACAAATTCAGATAATGAAAATTTAACAGAACTA 

ATAGAACTACAAATGCATCTGGAAAAAGAAATTTTAAACTTAATTGAGCAAACATTTCATGATAAAAAT 

ATATACAATTAAGTCACATCAACTCATTCTTTCCTCAAGAAAATATAAACTCAATAACAAAAGAAATAATAGATGG 

AAAAGAATATATTGCACCGCACATAATAGCAAATCAATTATTAAAAATAAAAGATAAAAAATATTTTGAACAATTT 

ATGCACTTTTTAAAAGTTGAAAACAGCAAAATAAAAACAATAATTGAAAAACAAAAAATTTCAGATCTTCACAATG 

AACTGTATTATTCAAAACAATCCCCGCCCAGAAGAAGAAAAAGGTCAACTGCCGATTCCGATAATAACAATAAATA 

CGATATAATACCAAAAATAATAGACCCAAATACAGGCATTGAAATAACTCCTAAAAATTTAAGATCTATTTTATCA 

AATGGCGACATAATACTAATAAAACCAAAAATAGATTGGACAGAATTTTTTTATTTTTGGCAACATGT^ 

TTGATGAAGAAAAATATGAAGCCACTAAAAAAATTGCATTCAATGGAATTGATAGCTTTGATATAAAATCA^ 

TACAAGCAATCAAATCAAATTCGATACAGCATCTACTCAAGGTTCAGGATACGAAAAGCTTTCAACATACGTACAA 

TCAAGAATATTAAAAATATTCTCACCAATAACAGACATAAGAACAATTCAAAAAGCTATTAATTTTGGA^ 

GATACATTGACAATAACTTTGGATATATGGTTCCATTAATATCCTCTAATTTATGGACAGATTCATTC^ 

AGAAATTCACAACAAAACCTATTGCTCTTTAATGGTTGATAGAATATATAAAATAGCAGGACTTAATC 
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AATTACGAAATTTCGGGAATAATTACTCCTGGAGAAATAAATGCAGCAGCTTACAATTTTTACATGTCTTATACGA 
TTGCAGGAATACTTCCAAGCGTGCTTCCAAAAAGGCTCATTAAACCAACATTAAAAGAAAAATTCATTGGTTACAA 
TAAAGAAATAGTAGATGCAATAGAATTAAAAAAATCGAAAGAAAAAATTTTTGGGAGAGCTTGCAACATTACAAAT 
CTCTGGTGCTCAGGAAGTTAA 



te.nt 



TGTGCCTTAATTGCAGATAATAAGTCAAAAAATTTAAGCACATCAGAAATCATATTAACACAAAAAACACTACTAG 
AAAGCTCTTTAATAAAAAATCCTTCTAATGTAGAATATCGAATACCAATATCCAGTATCCAAGAAATTTTAAACAA 
TAACAATGATTCTTTTTTAATAAAAAAAACAGCAGCAAAAATCAAAATAAGCCCTCAAAAACTTGAAGAAATAAAA 
AACTATCTAAATGCTTATAAAAATTATCTAAATAATGAAACAGAATGGATAAAGTTTATAGATCAAAGTAGCGTCA 
ATGGAAATTTAACAATTAAAATTGATACTGCTTTTGAAAAAAAAACAAATTTTAATCATACAAATTCAGATAATGA 
AAATTTAACAGAACTAATAGAACTACAAATGCATCTGGAAAAAGAAATTTTAAACTTAATTGAGCAAACATTTCAT 
GATAAAAATTTAGGATATATACAATTAAGTCACATCAACTCATTCTTTCCTCAAGAAAATATAAACTCAATAACAA 
AAGAAATAATAGATGGAAAAGAATATATTGCACCGCACATAATAGCAAATCAATTATTAAAAATAAAAGATAAAAA 
ATATTTTGAACAATTTATGCACTTTTTAAAAGTTGAAAACAGCAAAATAAAAACAATAATTGAAAAACAAAAAATT 
TCAGATCTTCACAATGAACTGTATTATTCAAAACAATCCCCGCCCAGAAGAAGAAAAAGGTCAACTGCCGATTCCG 
ATAATAACAATAAATACGATATAATACCAAAAATAATAGACCCAAATACAGGCATTGAAATAACTCCTAAAAATTT 
AAGATCTATTTTATCAAATGGCGACATAATACTAATAAAACCAAAAATAGATTGGACAGAATTTTTTTATTTTTGG 
CAACATGTGGGAATATTTGATGAAGAAAAATATGAAGCCACTAAAAAAATTGCATTCAATGGAATTGATAGCTTTG 
ATATAAAATCAATAATTACAAGCAATCAAATCAAATTCGATACAGCATCTACTCAAGGTTCAGGATACGAAAAGCT 
TTCAACATACGTACAATCAAGAATATTAAAAATATTCTCACCAATAACAGACATAAGAACAATTCAAAAAGCTATT 
AATTTTGGAAGAAGTAGATACATTGACAATAACTTTGGATATATGGTTCCATTAATATCCTCTAATTTATGGACAG 
ATTCATTCAATCTTGAAGAAATTCACAACAAAACCTATTGCTCTTTAATGGTTGATAGAATATATAAAATAGCAGG 
ACTTAATGTATCAAGAAATTACGAAATTTCGGGAATAATTACTCCTGGAGAAATAAATGCAGCAGCTTACAATTTT 
TACATGTCTTATACGATTGCAGGAATACTTCCAAGCGTGCTTCCAAAAAGGCTCATTAAACCAACATTAAAAGAAA 
AATTCATTGGTTACAATAAAGAAATAGTAGATGCAATAGAATTAAAAAAATCGAAAGAAAAAATTTTTGGGAGAGC 
TTGCAACATTACAAATCTCTGGTGCTCAGGAAGTTAA 



f 82 .aa 



MTRVFSKFFLFFCFSMLLFANSEDSNEKDIVSKDENPVFENEVLGYWVGYNDVSNIKNSIIYIYKYNGEVYGRILT 
IIKDGKKYDAKNPSGDTWGFENLAIEGLDFMWGLKYSSSSKKWDRGKIIDPKNGKIYNSEMRVDSKTGNLITKGK 
VWIFGRSKIWTRAKDDEIPKLDLHNLVPAPPVKK 



t82 .aa 



EDSNEKDIVSKDENPVFENEVLGYWVGYNDVSNIKNSIIYIYKYNGEVYGRILTIIKDGKKYDAKNPSGDTWGFE 
NIAIEGLDFMWGLKYSSSSKKWDRGKIIDPKNGKIYNSEMRVDSKTGNLITKGKWIFGRSKIWTRAKDDEIPKLD 
LHNLVPAPPAnCK 



f82nt 



ATGACTAGAGTTTTTTCAAAGTTTTTTCTTTTTTTTTGTTTTTCAATGCTTTTATTTGCAAATTCAGAAGATTCAA 

ATGAAAAGGACATTGTTAGCAAGGATGAAAACCCTGTTTTTGAAAATGAAGTTTTAGGATATTGGGTTGGTTAT^ 

TGATGTAAGTAACATAAAGAATTCTATTATCTATATTTATAAATATAATGGGGAAGTTTATGGCCGAATT^ 

ATAATAAAAGATGGCAAAAAGTATGATGCTAAAAATCCTTCAGGAGATACTGTAGTTGGGTTTC 

TAGAGGGTCTTGATTTTATGTGGGGTCTTAAGTATTCTTCTTCTTCTAAAAAGTG<^ 
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TCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTGTTGATAGTAAAACCGGAAATCTTATTACCAAGGGGAAA 
GTTTGGATTTTTGGTAGAAGTAAAATTTGGACAAGAGCTAAAGATGATGAAATACCAAAATTAGATTTGCATAATC 
TTGTTCCAGCGCCCCCTGTGAAAAAATAA 

f82 .nt 

GAAGATTCAAATGAAAAGGACATTGTTAGCAAGGATGAAAACCCTGTTTTTGAAAATGAAGTTTTAGGATATTGGG 
TTGGTTATAATGATGTAAGTAACATAAAGAATTCTATTATCTATATTTATAAATATAATGGGGAAGTTTATGGCCG 
AATTTTAACTATAATAAAAGATGGCAAAAAGTATGATGCTAAAAATCCTTCAGGAGATACTGTAGTTGGGTTTGAA 
AATCTTGCAATAGAGGGTCTTGATTTTATGTGGGGTCTTAAGTATTCTTCTTCTTCTAAAAAGTGGGATAGGGGCA 
AAATAATAGATCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTGTTGATAGTAAAACCGGAAATCTTATTAC 
CAAGGGGAAAGTTTGGATTTTTGGTAGAAGTAAAATTTGGACAAGAGCTAAAGATGATGAAATACCAAAATTAGAT 
TTGCATAATCTTGTTCCAGCGCCCCCTGTGAAAAAATAA 



f 86 .aa 



^^SrKLMl^LITFATSLLAQTNKASTGLKTDQSFNNSLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKD 

LIIKILEENNIINEIGSNNIESKNIALVNVSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEIIA 

lASGITQPNNKLKGSGYTIDSVIINENQNINHSYNIILKKGNYTLINRIHKILTSKKINNKIKSDSTIEIEAKNIS 

LLEEIENIKIETNPKILIDKKNGIILASENAKIGTFTFSIEKDNQNIFLSKJSnSTKTTIQVNSMKLNEFILKNSN^ 

NKELIQIIQAAQKINKLNGELILEEIDGNQN 



t8 6 .aa 



LKTDQSFNNSLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKDLIIKILEENNIINEIGSNNIESKNI 

ALVNVSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEIIAIASGITQPNNKLKGSGYTIDSVIIN 

ENQNINHSYNIILKKGNYTLINRIHKILTSKKINNKIKSDSTIEIEAKNISLLEEIENIKIETNPKILIDKKNGII 

LASENAKIGTFTFSIEKDNQNIFLSKJSnSTKTTIQWSMKLNEFILKNSNNLSNKELIQIIQAAQKINKL^ 

IDGNQN 

f86.nt 



ATGAACAAACTAATGTTGATGTTAATTACATTTGCAACGAGTCTATTAGCCCAAACAAACAAAGCTTCAACAGGAC 
TAAAAACAGATCAATCATTTAACAATAGCCTATCTGAAAGCGTAAAATTAAAAGAAATTGCGGATATTTATCCCAC 
AAATACAAATTTTTTAACAGGTATTGGAATAGTAGCGGGACTTGCTGGAAAAGGAGACTCTATAAAACAAAAAGAC 
CTTATAATTAAAATTTTAGAAGAAAACAATATAATAAATGAAATAGGCTCTAATAACATAGAAAGTAAAAATATTG 
CACTAGTAAATGTCAGTCTCCAAGTAAAAGGTAATACAATCAAAGGTTCAAAACATAAAGCTTGCGTTGCATCAAT 
ACTGGACTCAAAAGATTTAACAAATGGAATACTTTTAAAAACAAATCTTAAAAATAAAGAGGGGGAAATAATAGCA 
ATTGCATCAGGAATTACACAGCCCAATAATAAATTAAAAGGATCTGGATATACTATAGATAGTGTAATAATAAATG 
AGAATCAAAATATTAACCACAGTTATAATATAATTCTTAAAAAAGGAAATTATACATTAATAAATAGAATTCATAA 
AATATTAACCTCTAAAAAAATCAACAACAAAATTAAATCAGACAGCACAATAGAAATAGAAGCAAAAAACATAAGC 
CTATTAGAAGAGATTGAAAATATTAAAATAGAAACCAACCCCAAGATATTAATAGACAAAAAAAATGGTATTATTT 
TAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCCATTGAAAAAGACAATCAAAACATTTTTTTAAGTAA 
AAATAACAAAACAACAATTCAAGTAAACTCAATGAAATTAAATGAATTTATATTAAAAAATTCCAACAATCTTAGC 
AATAAAGAATTAATTCAAATAATTCAAGCTGCGCAAAAAATTAATAAATTAAATGGGGAACTTATCTTGGAGGAAA 
TTGATGGAAACCAAAATTAA 



t86 .nt 
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CTAAAAACAGATCAATCATTTAACAATAGCCTATCTGAAAGCGTAAAATTAAAAGAAATTGCGGATATTTATCCCA 
CAAATACAAATTTTTTAACAGGTATTGGAATAGTAGCGGGACTTGCTGGAAAAGGAGACTCTATAAAACAAAAAGA 
CCTTATAATTAAAATTTTAGAAGAAAAC/^TATAATAAATGAAATAGGCTCTAATAACATAGAAAGTAAAAATATT 
GCACTAGTAAATGTCAGTCTCCAAGTAAAAGGTAATACAATCAAAGGTTCAAAACATAAAGCTTGCGTTGCATCAA 

tactggactcaaaagatttaacaaatggaatacttttaaaaacaaatcttaaaaataaagaggggga;^ 

AATTGCATCAGGAATTACACAGCCCAATAATAAATTAAAAGGATCTGGATATACTATAGATAGTGTAATAATAAAT 

gagaatcaaaatattaaccacagttataatataattcttaaaaaaggaaattatacattaataaatagaattcata 

AAATATTAACCTCTAAAAAAATCAACAACAAAATTAAATCAGACAGCACAATAGAAATAGAAGCAAAAAACATAAG 

cctattagaagagattgaaaatattaaaatagaaaccaaccccaagatattaatagacaaaaaaaatggtattatt 

TTAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCCATTGAAAAAGACAATCAAAACATTTTTTTAAGTA 

aaaataacaaaacaacaattcaagtaaactcaatgaaattaaatgaatttatattaaaaaattccaacaatcttag 

CAATAAAGAATTAATTCAAATAATTCAAGCTGCGCAAAAAATTAATAAATTAAATGGGGAACTTATCTTGGAGGAA 
ATTGATGGAAACCAAAATTAA 



fSO.aa 



MCPITFTIPFFLAIFFAFSSSFVTDSSVSLLSRNTSLFSTLTPISLPIISGTLPAIVTLSKKYLSISLSFSKMIFI 
KSLFEVIKLPIWLFIIFASGYFLNAFSIFLCISSFLSFMFI 



t90 .aa 



SSFVTDSSVSLLSRNTSLFSTLTPISLPIISGTLPAIVTLSKKYLSISLSFSKMIFIKSLFEVIKLPIWLFIIFAS 
GYFLNAFSIFLCISSFLSFMFI 

f90.nt 



ATGTGTCCTATTACTTTTACCATTCCATTTTTTCTAGCAATATTTTTTGCTTTTTCAAGCTCCTTTGTTACGGACT 
CTTCTGTGTCTTTGCTATCAAGAAATACGTCTCTTTTTTCTACTTTAACTCCAATTTCTTTGCCTATTATTTCTGG 
TACGCTTCCTGCAATAGTTACGCTGTCGAAAAAATATCTGTCAATCTCTTTAAGCTTTTCTAAAATGATTTTCATC 
AAATCTTTATTTGAAGTGATTAAACTTCCCATATGGTTATTCATTATTTTTGCATCAGGATACTTTTTAAATGCTT 
TTTCGATTTTTTTGTGTATTTCTTCTTTTTTATCTTTTATGTTTATATGA 



t90 .nt 



AGCTCCTTTGTTACGGACTCTTCTGTGTCTTTGCTATCAAGAAATACGTCTCTTTTTTCTACTTTAACTCCAATTT 
CTTTGCCTATTATTTCTGGTACGCTTCCTGCAATAGTTACGCTGTCGAAAAAATATCTGTCAATCTCTTTAAGCTT 
TTCTAAAATGATTTTCATCAAATCTTTATTTGAAGTGATTAAACTTCCCATATGGTTATTCATTATTTTTGCATCA 
GGATACTTTTTAAATGCTTTTTCGATTTTTTTGTGTATTTCTTCTTTTTTATCTTTTATGTTTATATGA 



f469.aa 

MANVALSSGFISQKIFGIIIIIWFLPTIIATPIINFLFKINKSGLKKELPIDQIOTHICVSFEYDNI^ILIWDFKN 

ELRKEGFFTQQIKm)SSQYINARKNNISFSIKREGSKITFECPNNHLIIIQDLFRETIUSrLEKITKEVEWSLI^ 

KJJDYSINYDKILSNINLNKRIKKENIILELKSSNKADVIRELLSVINIEIDKERIFQDLMEREKLITTALKEG 

PHLKTNLISKIHIAIGISHEGIDFNALDKNLSHVFILILCPAKDYVSYPRILASWGKVDLYKKEILNAKTDKEIY 

NIIVS2 



t469 .aa 
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VFLPTIIATPIINFLFKINKSGLKKELPIDQlSrrHICVSFEYDNIJUCILIWDFKNELRKEGFFTQQIKNDSSQyiNA 
RKNNISFSIKREGSKITFECPNNHLIIIQDLFRETILNLEKITKEVETVSLRAKKLDYSINYDKILSNINLNKRIK 
KENIILELKSSNKADVIRELLSVINIEIDKERIFQDLMEREKLITTALKEGFAIPHLKTNLISKIHIAIGISHEGI 
DFNALDKNLSHVF I L I LC PAKD YVS YPR I LiAS WGKVDL YKKE I LNAKTDKE I YN 1 1 VS Z 

f469.nt 

ATGGCAAATGTAGCATTATCTTCAGGATTTATTAGCCAAAAAATATTTGGAATCATAATAATAATGGTGTTTTT 

CAACAATCATTGCAACACCCATAATAAACTTTTTATTTAAAATAAATAAAAGTGGACTTAAAAAAGAACTCCCAAT 

AGATCAAAATACACACATATGCGTATCATTTGAATATGATAATTTAGCCAAAATTCTTATATGGGACTTTAAAAAT 

GAGTTAAGAAAAGAAGGATTTTTTACACAACAAATTAAAAATGATTCTTCACAATATATTAATGCAAGAAAAAACA 

ATATATCCTTCTCAATAAAACGAGAAGGTAGCAAAATCACATTTGAATGCCCAAATAATCATTTAATTATAATACA 

AGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAAAATAACCAAAGAAGTTGAAACAGTCTCTTTAAGAGCAAAA 

AAACTAGATTACTCAATAAATTACGATAAAATCCTTAGTAATATCAACCTAAATAAAAGAATAAAAAAGGAAAACA 

TTATTCTAGAATTAAAATCAAGCAATAAGGCTGATGTAATAAGAGAGCTTCTAAGCGTAATAAACATTGAAATTGA 

TAAAGAAAGAATATTCCAAGATTTAATGGAAAGAGAAAAGTTAATTACTACTGCACTAAAAGAAGGCTTTGCCATT 

CCCCATTTAAAAACAAATTTAATATCAAAAATACATATTGCAATAGGAATAAGCCATGAGGGAATTGACTTTAATG 

CTCTTGACAAGAACTTAAGTCATGTTTTTATATTAATACTGTGCCCAGCAAAAGATTACGTTAGCTACCCTAGAAT 

TTTAGCATCTGTTGTGGGCAAAGTTGATCTGTACAAAAAAGAAATTTTAAATGCAAAAACAGATAAAGAAATTTAT 

AATATAATAGTGAGCTAA 

t469.nt 

TTTTTGCCAACAATCATTGCAACACCCATAATAAACTTTTTATTTAAAATAAATAAAAGTGGACTTAAAAAAGAAC 
TCCCAATAGATCAAAATACACACATATGCGTATCATTTGAATATGATAATTTAGCCAAAATTCTTATATGGGACTT 
TAAAAATGAGTTAAGAAAAGAAGGATTTTTTACACAACAAATTAAAAATGATTCTTCACAATATATTAATGCAAGA 
AAAAACAATATATCCTTCTCAATAAAACGAGAAGGTAGCAAAATCACATTTGAATGCCCAAATAATCATTTAATTA 
TAATACAAGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAAAATAACCAAAGAAGTTGAAACAGTCTCTTTAAG 
AGCAAAAAAACTAGATTACTCAATAAATTACGATAAAATCCTTAGTAATATCAACCTAAATAAAAGAATAAAAAAG 
GAAAACATTATTCTAGAATTAAAATCAAGCAATAAGGCTGATGTAATAAGAGAGCTTCTAAGCGTAATAAACATTG 
AAATTGATAAAGAAAGAATATTCCAAGATTTAATGGAAAGAGAAAAGTTAATTACTACTGCACTAAAAGAAGGCTT 
TGCCATTCCCCATTTAAAAACAAATTTAATATCAAAAATACATATTGCAATAGGAATAAGCCATGAGGGAATTGAC 
TTTAATGCTCTTGACAAGAACTTAAGTCATGTTTTTATATTAATACTGTGCCCAGCAAAAGATTACGTTAGCTACC 
CTAGAATTTTAGCATCTGTTGTGGGCAAAGTTGATCTGTACAAAAAAGAAATTTTAAATGCAAAAACAGATAAAGA 
AATTTATAATATAATAGTGAGCTAA 

f477 .aa 

MEKPQGVSIVGAISGAMHVHLMAEHYGVPWLHTDHCAKl^LPWVEGLLEYGEKYYSQHKKPLFSSHM^^ 
KENIEISKKFLERMAKIEMFLEIELGITGGEEDGVDNSDRALHELFSTPEDIYYGYSELLKVSPNFQIAAAFGNVH 
GWKPGNWLTPKVLKIXSQDYVISKTGVNMAKPVSYVFHGGSGSTIDEINEALSYGVVK^ 
YYKKNESRLQGQLGIX3KDIDIPNKKFYDPRVWLREAEVSMKDRVKIACKNLNNINRNZ 

t477 .aa 

MHVHIJyiAEHYGVPWLHTDHCAKNLLPWEGLLEYGEKYYSQHKKPLFSSHMLDLSEEPIKENIEISKKFLERl^ 
lEMFLEIELGITGGEEIXSVDNSDRALHELFSTPEDIYYGYSELLKVSPNFQIAAAFGNVHGVYKPGNVKLTPKVLK 
DGQDYVISKTGVNMAKPVSYVFHGGSGSTIDEINEAIiSYGVVKMNIDTDTQWAAWEGVLNYYK^ 
KDIDIPNKKFYDPRVWLREAEVSMKDRVKIACKNLNNINRNZ 

f477.nt 

ATGGAAAAACCACAAGGAGTTTCAATAGTTGGAGCTATTTCTGGTGCTATGCATGTTCATTTAATGGCA^^ 

ATGGTGTTCCTGTTGTTCTTCATACTGATCACTGTGCTAAAAATTTGCTTCCTT^^ 

TGGAGAGAAATACTATAGTCAGCACAAAAAACCATTATTTTCTTCACATATGTTAGATTTATCAG^ 
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AAAGAAAATATTGAAATTTCTAAAAAATTCTTAGAAAGAATGGCAAAAATTGAAATGTTTTTGGAAATAGAGCTTG 
GAATTACGGGTGGGGAAGAGGATGGAGTTGACAATTCAGATAGAGCTTTGCATGAACTATTTTCTACTCCTGAGGA 
TATTTATTATGGATATTCAGAACTTTTAAAAGTTAGCCCAAATTTTCAGATTGCAGCAGCTTTTGGAAATGTTCAT 
GGGGTATATAAACCGGGGAATGTTAAGCTTACTCCAAAAGTTTTAAAAGATGGTCAAGATTATGTCATATCAAAAA 
CAGGAGTAAATATGGCTAAGCCAGTTTCTTATGTTTTTCATGGAGGGTCTGGATCTACAATTGATGAGATTAATGA 
GGCGCTTTCTTATGGCGTTGTAAAGATGAATATTGACACAGATACACAGTGGGCTGCCTGGGAGGGTGTTTTAAAT 
TATTACAAAAAAAATGAAAGTCGTTTGCAAGGTCAATTAGGAGATGGCAAGGATATTGATATTCCAAATAAGAAAT 
TTTATGATCCAAGGGTTTGGTTAAGAGAAGCTGAAGTTTCTATGAAAGACCGTGTGAAGATTGCATGCAAAAATCT 
TAATAATATTAATAGAAATTAA 

t477.nt 

ATGCATGTTCATTTAATGGCAGAGCATTATGGTGTTCCTGTTGTTCTTCATACTGATCACTGTGCTAAAAATTTGC 
TTCCTTGGGTTGAAGGCCTTTTAGAATATGGAGAGAAATACTATAGTCAGCACAAAAAACCATTATTTTCTTCACA 
TATGTTAGATTTATCAGAAGAACCTATTAAAGAAAATATTGAAATTTCTAAAAAATTCTTAGAAAGAATGGCAAAA 
ATTGAAATGTTTTTGGAAATAGAGCTTGGAATTACGGGTGGGGAAGAGGATGGAGTTGACAATTCAGATAGAGCTT 
TGCATGAACTATTTTCTACTCCTGAGGATATTTATTATGGATATTCAGAACTTTTAAAAGTTAGCCCAAATTTTCA 
GATTGCAGCAGCTTTTGGAAATGTTCATGGGGTATATAAACCGGGGAATGTTAAGCTTACTCCAAAAGTTTTAAAA 
GATGGTCAAGATTATGTCATATCAAAAACAGGAGTAAATATGGCTAAGCCAGTTTCTTATGTTTTTCATGGAGGGT 
CTGGATCTACAATTGATGAGATTAATGAGGCGCTTTCTTATGGCGTTGTAAAGATGAATATTGACACAGATACACA 
GTGGGCTGCCTGGGAGGGTGTTTTAAATTATTACAAAAAAAATGAAAGTCGTTTGCAAGGTCAATTAGGAGATGGC 
AAGGATATTGATATTCCAAATAAGAAATTTTATGATCCAAGGGTTTGGTTAAGAGAAGCTGAAGTTTCTATGAAAG 
ACCGTGTGAAGATTGCATGCAAAAATCTTAATAATATTAATAGAAATTAA 

£488. aa 

MPSSFPFLLVNGSSGIAVGMATlSnyiAPHNLREICDAIVYMLDNENASIFDLLKIVKGPDFPTFGEIVY^ 

TGKGSWIRARYHIEERAEDRNAIIVTEIPYTVNKSALLMKVALLAKEEKLEGLLDIRDESDREGIRIVLEVKRGF 

DPHVIMNLLYEYTEFKKHFSINNLALVNGIPKQLNLEELLFEFIEHRKNIIERRIEFDLRKAKEKAHVLEGLNI 

NNIDEVIKIIKSSKLAKDARERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSLIKDYEDILLNPVR 

IINIIREETINLGLKFGDERRTKIIYDEEVLKTSMSDLMQKENIWMLTKKGFLKRLSQNEYKLQGTGGKGLSSFD 

LNDGDEIVIALCVNTHDYLFMISNEGKLYIilNAYEIKDSSRASKGQNISELINLGDQEEILTIKNSKDLTDDAYLL 

LTTASGKIARFESTDFKAVKSRGVIVIKLNDKDFVTSAEIVFKDEKVICLSKKGSAFIFNSRDVRLTNRGTQGVCG 

MKLKEGDLFVKVLSVKENPYLLIVSENGYGKRLNMSKISELKRGATGYTSYKKSDKKAGSWDAIAVSEDDEILLV 

SKRSKALRTVAGKVSEQGKDARGIQVLFLDNDSLVSVSKFIKZ 

t488 .aa 

MATNMAPHNLREICDAI\miLDNENASIFDLLKIVKGPDFPTFGEIVYNDNLIKAYKTGKGSWIRARYHIE^ 
DRNAIIVTEIPYTVNKSALLMKVALLAKEEKLEGLLDIRDESDREGIRIVLEVKRGFDPHVIMNLLYEYTEFKKHF 
SINNLALVNGI PKQIiNLEELLFEFIEHRKNI I ERRI EFDLRKAKEKAHVLEGLNIAIJS^ 

RERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSLIKDYEDILLNPVRIINIIREETINLGLKFGDE 

RRTKIIYDEEVLKTSMSDLMQKENIVVMLTKKGFLKRLSQNEYKLQGTGGKGLSSFDLNIX5DEIVIALCVNTHDYL 

FMISNEGKLYLINAYEIKDSSRASKGQNISELINLX3DQEEILTIKNSKDLTDDAYLLLTTASGKIARFESTDFKAV 

KSRGVIVIKLNDKDFVTSAEIVFKDEKVICLSKKGSAFIFNSRDVRLTNRGTQGVCGMKLKEGDLFVKVLSVKENP 

YLLIVSENGYGKRLNMSKISELKRGATGYTSYKKSDKKAGSVVDAIAVSEDDEILLVSKRSKALRTVAGKVSE 

DARGIQVLFLDNDSLVSVSKFIKZ 

f488.nt 

ATGCCGTCATCATTTCCATTTCTTTTGGTAAATGGCTCTAGTGGAATTGCTGTTGGAATGGCTACTAATATGGCAC 

CTCATAATTTAAGAGAAATTTGTGATGCCATTGTTTACATGCTAGATAATGAGAATGCTTCTATATTTGATTTGCT 

TAAAATAGTTAAAGGGCCTGATTTCCCAACTTTTGGAGAGATTGTTTATAATGATAATTTAATTAAAGCATACAAA 

ACTGGCAAGGGAAGTGTTGTTATTAGGGCAAGATATCATATTGAAGAAAGAGCAGAAGATAGAAATGCTATAAT^ 

TTACAGAAATACCTTATACGGTAAATAAATCTGCACTTCTTATGAAAGTTGCGCTTTTAGCAAAAGAAGAA^ 

AGAAGGACTTTTAGATATAAGAGATGAATCTGATCGAGAAGGTATTAGGATAGTTCTTGAAGTTAAAAGAGGATTT 
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GATCCTCATGTTATTATGAATTTGCTTTATGAATATACTGAATTTAAAAAGCATTTTAGTATAAATAATTTAGCCC 

TTGTTAATGGTATTCCCAAACAGTTAAATTTAGAAGAATTGTTATTTGAATTTATTGAGCATAGAAAAAATATT^ 

CGAAAGACGTATTGAATTTGACTTGAGAAAGGCAAAAGAGAAAGCACATGTTCTTGAGGGATTAAATATTGCTTTA 

AATAATATAGATGAGGTTATTAAGATTATTAAATCATCTAAATTAGCAAAAGATGCAAGGGAGAGGCTTGTTTCGA 

ATTTTGGTCTTTCAGAGATTCAGGCCAATTCAGTTCTTGATATGAGGTTACAAAAACTTACAGCCCTTGAGATTTT 

TAAGCTTGAAGAGGAGCTTAATATACTGTTAAGCTTAATAAAAGATTATGAAGATATTCTCTTGAATCCAGTAAGG 

ATTATTAATATTATAAGAGAAGAAACTATTAATTTAGGTTTGAAATTTGGCGATGAACGTCGAACTAAAATAAT^ 

ATGATGAGGAGGTTTTAAAAACTAGTATGTCGGATTTAATGCAAAAAGAAAATATTGTTGTTATGCTTACAAAGAA 

AGGTTTCCTTAAAAGACTTTCACAAAATGAGTATAAATTGCAAGGTACGGGAGGAAAAGGACTAAGTTCGTTTGA^ 

CTAAATGATGGAGATGAGATTGTTATTGCTTTGTGTGTCAATACTCATGATTATTTATTTATGATTTCAAATGAAG 

GAAAGCTTTATTTAATCAATGCTTATGAAATAAAAGATTCTTCAAGAGCTTCAAAAGGTCAGAATATTAGTGAGCT 

TATTAATTTAGGAGATCAAGAAGAAATATTAACTATTAAGAATAGTAAAGATTTAACTGATGATGCTTATTTATTG 

CTTACAACTGCAAGTGGAAAGATAGCTAGATTCGAATCTACAGATTTTAAAGCAGTAAAGTCACGAGGTGTTATTG 

TTATTAAACTGAATGATAAAGATTTTGTTACAAGTGCAGAGATTGTTTTTAAGGATGAAAAAGTAATTTGTCTTTC 

TAAAAAGGGTAGTGCATTTATATTTAATTCAAGGGATGTTAGGCTTACTAATAGAGGTACCCAAGGTGTTTGTGGA 

ATGAAATTAAAAGAAGGTGATTTGTTTGTTAAAGTTTTATCGGTTAAAGAAAATCCTTATCTTTTGATTGTTTCTG 

AAAATGGGTATGGAAAAAGGTTAAACATGTCTAAAATATCTGAGCTTAAAAGAGGAGCCACTGGTTATACTAGTTA 

TAAAAAATCTGATAAAAAAGCGGGTAGTGTTGTTGATGCTATAGCAGTTTCAGAGGATGATGAAATCTTGCTTGTA 

AGTAAACGTTCAAAAGCTTTAAGAACAGTAGCTGGAAAAGTATCTGAACAAGGCAAAGATGCTAGAGGAATTCAAG 

TATTATTTCTTGATAATGACAGCTTGGTTTCTGTTTCAAAATTTATTAAATAA 

t488.nt 

ATGGCTACTAATATGGCACCTCATAATTTAAGAGAAATTTGTGATGCCATTGTTTACATGCTAGATAATGAGAATG 

CTTCTATATTTGATTTGCTTAAAATAGTTAAAGGGCCTGATTTCCCAACTTTTGGAGAGATTGTTTATAATGATAA 

TTTAATTAAAGCATACAAAACTGGCAAGGGAAGTGTTGTTATTAGGGCAAGATATCATATTGAAGAAAGAGCAGAA 

GATAGAAATGCTATAATTGTTACAGAAATACCTTATACGGTAAATAAATCTGCACTTCTTATGAAAGTTGCGCTTT 

TAGCAAAAGAAGAAAAGCTAGAAGGACTTTTAGATATAAGAGATGAATCTGATCGAGAAGGTATTAGGATAGTTCT 

TGAAGTTAAAAGAGGATTTGATCCTCATGTTATTATGAATTTGCTTTATGAATATACTGAATTTAAAAAGCATTTT 

AGTATAAATAATTTAGCCCTTGTTAATGGTATTCCCAAACAGTTAAATTTAGAAGAATTGTTATTTGAATTTATTG 

AGCATAGAAAAAATATTATCGAAAGACGTATTGAATTTGACTTGAGAAAGGCAAAAGAGAAAGCACATGTTCTTGA 

GGGATTAAATATTGCTTTAAATAATATAGATGAGGTTATTAAGATTATTAAATCATCTAAATTAGCAAAAGATGCA 

AGGGAGAGGCTTGTTTCGAATTTTGGTCTTTCAGAGATTCAGGCCAATTCAGTTCTTGATATGAGGTTACAAAAAC 

TTACAGCCCTTGAGATTTTTAAGCTTGAAGAGGAGCTTAATATACTGTTAAGCTTAATAAAAGATTATGAAGATAT 

TCTCTTGAATCCAGTAAGGATTATTAATATTATAAGAGAAGAAACTATTAATTTAGGTTTGAAATTTGGCGATGAA 

CGTCGAACTAAAATAATTTATGATGAGGAGGTTTTAAAAACTAGTATGTCGGATTTAATGCAAAAAGAAAATATTG 

TTGTTATGCTTACAAAGAAAGGTTTCCTTAAAAGACTTTCACAAAATGAGTATAAATTGCAAGGTACGGGAGG 

AGGACTAAGTTCGTTTGATCTAAATGATGGAGATGAGATTGTTATTGCTTTGTGTGTCAATACTCATGATTATTTA 

TTTATGATTTCAAATGAAGGAAAGCTTTATTTAATCAATGCTTATGAAATAAAAGATTCTTCAAGAGCTTCAAAAG 

GTCAGAATATTAGTGAGCTTATTAATTTAGGAGATCAAGAAGAAATATTAACTATTAAGAATAGTAAAGATTTAAC 

TGATGATGCTTATTTATTGCTTACAACTGCAAGTGGAAAGATAGCTAGATTCGAATCTACAGATTTTAAAGCAGT^ 

AAGTCACGAGGTGTTATTGTTATTAAACTGAATGATAAAGATTTTGTTACAAGTGCAGAGATTGTTTTTAAGGATG 

AAAAAGTAATTTGTCTTTCTAAAAAGGGTAGTGCATTTATATTTAATTCAAGGGATGTTAGGCTTACTAATAGAGG 

TACCCAAGGTGTTTGTGGAATGAAATTAAAAGAAGGTGATTTGTTTGTTAAAGTTTTATCGGTTAAAGAAAATCCT 

TATCTTTTGATTGTTTCTGAAAATGGGTATGGAAAAAGGTTAAACATGTCTAAAATATCTGAG^ 

CCACTGGTTATACTAGTTATAAAAAATCTGATAAAAAAGCGGGTAGTGTTGTTGATGCTATAGCAGTTTCAGAGGA 

TGATGAAATCTTGCTTGTAAGTAAACGTTCAAAAGCTTTAAGAACAGTAGCTGGAAAAGTATCTGAACAAGGCAAA 

GATGCTAGAGGAATTCAAGTATTATTTCTTGATAATGACAGCTTGGTTTCTGTTTCAAAATTTATTAAATAA 

f494.aa 

MFALIRKIFMIYFLCITLAGFAMIFIDSKFTEQPNVKENQSKINQHTIEPNLIMFTSSIGGFLGVYVGIWIFNYDK 
SNFYLNWGNLIILIYNIALIITVYSKSHS 



t494.aa 
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MIFIDSKFTEQPNVKENQSKINQHTIEPNLIMFTSSIGGFLGVWGIWIFNYDKSNFYUSnA^GNLIILITO 
VYSKSHS 

f494.nt 

ATGTTTGCATTAATTAGAAAAATATTTATGATCTATTTTTTATGCATTACTCTTGCAGGTTTTGCCATGATTTTTA 
TTGACAGCAAATTTACCGAACAGCCTAATGTTAAAGAAAATCAAAGCAAAATTAATCAACATAC7UVTTGAACCCAA 
TTTAATCATGTTTACATCTTCTATAGGAGGATTTTTAGGTGTTTATGTTGGAATTTGGATCTTTAACTATGACAAA 
AGCAATTTTTACCTAAATTGGGGAAATTTAATAATATTAATATACAACATAGCCCTAATTATCACTGTATACTCAA 
AATCACATAGTTAG 

t494.iit 

ATGATTTTTATTGACAGCAAATTTACCGAACAGCCTAATGTTAAAGAAAATCAAAGCAAAATTAATCAACATACAA 
TTGAACCCAATTTAATCATGTTTACATCTTCTATAGGAGGATTTTTAGGTGTTTATGTTGGAATTTGGATCTTTAA 
CTATGACAAAAGCAATTTTTACCTAAATTGGGGAAATTTAATAATATTAATATACAACATAGCCCTAATTATCACT 
GTATACTCAAAATCACATAGTTAG 

£516. aa 

MKKTPNTCIFLTLLIISNLNAIoANEEGNTNEKNDQPKQISNFFSPERGFIYSTGIGIGVGFFLNSNIKHLIFRPYY 
TFSNNTFDFLIVAMILTRESLNIPKKMQYFKSYIGGGINWHIANLIKKTKYFSATIGIGGRFYLSTNFIEDIRFYE 
KLPYVIEPYMFIEISSKKAIPLMGLDFKIDFLFLDTFNISFNFTIRYNFKDKNEMET 

t516.aa 

NEEGNTNEKNDQPKQISNFFSPERGFIYSTGIGIGVGFFLNSNIKHLIFRPYYTFSNNTFDFLIVAMILTRESLNI 

PKKMQYFKSYIGGGINWHIANLIKKTKYFSATIGIGGRFYLSTNFIEDIRFYEKLPYVIEPYMFIEISSKKAI^ 

GLDFKIDFLFLDTFNISFNFTIRYNFKDKNEMET 

f516.nt 

ATGAAAAAAACTCCAAACACTTGTATTTTCTTAACATTGCTTATCATTTCCAATTTAAATGCACTTGCAAATGAAG 
AAGGCAATACTAATGAAAAAAATGATCAACCCAAACAAATCTCTAATTTTTTTAGCCCAGAAAGAGGGTTCATATA 
TTCAACAGGAATTGGGATTGGAGTTGGATTTTTTCTAAATTCAAATATTAAACACCTTATCTTTAGACCTTATTAT 
ACATTCTCTAATAATACTTTTGATTTTTTAATCGTTGCTATGATATTAACAAGGGAAAGCCTTAATATCCCCAAAA 
AAATGCAATACTTTAAATCTTATATTGGAGGAGGAATAAACTGGCACATTGCAAACTTAATTAAAAAAACAAAATA 
TTTTTCCGCCACCATTGGCATAGGTGGTCGTTTTTACCTATCTACAAACTTTATAGAAGACATTCGATTTTACGAA 
AAATTGCCTTATGTAATAGAGCCTTATATGTTTATTGAAATTTCTTCTAAAAAGGCAATTCCTTTAATGGGGTTAG 
ACTTTAAAATTGATTTTTTATTTTTAGATACATTTAACATTTCTTTTAATTTTACTATTAGATATAATTTTAAGGA 
CAAAAACGAGATGGAAACATGA 

t516.nt 

aatgaagaaggcaatactaatgaaaaaaatgatcaacccaaacaaatctctaatttttttagcccagaaagagggt 

tcatatattcaacaggaattgggattggagttggattttttctaaattcaaatattaaacaccttatctttagacc 

ttattatacattctctaataatacttttgattttttaatcgttgctatgatattaacaagggaaagcctt;^ 

cccaaaaaaatgcaatactttaaatcttatattggaggaggaataaactggcacattgcaaacttaattaaaaaaa 

caaaatatttttccgccaccattggcataggtggtcgtttttacctatctacaaactttatagaagacattcgatt 

ttacgaaaaattgccttatgtaatagagccttatatgtttattgaaatttcttctaaaaaggcaattcctttaatg 

gggttagactttaaaattgattttttatttttagatacatttaacatttcttttaattttactattagatataatt 

ttaaggacaaaaacgagatggaaacatga 



f517.aa 
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MIPWASGGILIALSIAFVGIGPDGPNFAEHPFYKQIADIGSIAFGMMLPVLAGFIAMAIADKPGLTPGLVGGVMS 
GNVKAGFLGAIFAGFLAGWARFLARRSVPEWLRPVMPIFVIPLISTIIVGFFMLYFGVYIGKFMGVLESGLKSLQ 
SNSETFGVLGKIFLGLVLGSMITVDMGGPFNKVAFLFGVGLIPQVPEIMGMVAAAIPVPPMAMGLATFLAPKLF^ 
EEKESGKIAFLISFIGISEGAIPFAASDPGRVIPSIWGGAVSSIIAAFLGVANHAPHGGPIVLPVIDNKFGFIIA 
lAVGVAVATALVIFLKSLKLKESE 

tSlT.aa 

DKPGLTPGLVGGVMSGWKAGFLGAI FAGFIAGYVARFLARRS VPEWLRPVMP I FVI PL I STI I VGFFMLYFGVYI 
GKFMGVLESGLKSLQSNSETFGVLGKIFLGLVLGSMITVDMGGPFNKVAFLFGVGLIPQVPEIMGMVAAAIPVPPM 
AMGLATFLAPKLFENEEKESGKIAFLISFIGISEGAIPFAASDPGRVIPSIWGGAVSSIIAAFLGVANHAPHGGP 
IVLPVIDNKFGFIIAIAVGVAVATALVIFLKSLKLKESE 



f517 .nt 

ATGATTCCTGTTGTTGCAAGTGGAGGAATTTTAATTGCTCTTAGCATTGCTTTTGTTGGGATTGGACCTGATGGGC 
CTAATTTTGCTGAGCATCCATTTTATAAGCAGATTGCAGATATTGGTTCTATAGCTTTTGGGATGATGTTGCCCGT 
GCTTGCTGGTTTTATTGCAATGGCAATTGCTGATAAGCCTGGTCTTACCCCCGGTCTTGTTGGTGGAGTAATGTCT 
GGGAATGTAAAAGCAGGTTTCTTGGGCGCAATATTTGCGGGCTTTCTTGCAGGTTATGTTGCAAGGTTTTTAGCAA 
GAAGATCTGTTCCTGAGTGGTTAAGACCTGTAATGCCTATATTTGTAATTCCGCTAATAAGCACCATTATTGTCGG 
CTTTTTTATGCTGTATTTTGGTGTTTATATTGGAAAATTTATGGGGGTGCTTGAGAGTGGGCTTAAATCTTTACAG 
AGTAATTCGGAAACTTTTGGCGTGTTGGGTAAAATTTTCTTAGGCTTAGTACTAGGTTCAATGATTACTGTTGATA 
TGGGCGGACCTTTTAATAAAGTGGCATTTCTTTTTGGTGTAGGGCTAATTCCTCAAGTGCCAGAAATAATGGGAAT 
GGTAGCAGCAGCAATTCCTGTTCCTCCTATGGCTATGGGGCTTGCAACCTTTTTAGCACCTAAATTGTTTGAAAAT 
GAAGAAAAAGAATCTGGTAAAATAGCCTTTTTAATTTCATTTATTGGTATTAGCGAAGGAGCTATTCCTTTTGCTG 
CTAGTGATCCCGGACGGGTAATCCCTTCGATAGTGGTAGGGGGAGCTGTATCAAGCATTATTGCCGCTTTTTTAGG 
CGTTGCTAATCATGCTCCACACGGAGGACCAATAGTACTTCCTGTTATTGATAATAAATTTGGGTTTATTATTGCA 
ATTGCTGTTGGAGTTGCGGTTGCAACAGCTTTGGTAATTTTTTTGAAATCTTTAAAATTAAAGGAATCTGAATGA 

t517 .nt 

GATAAGCCTGGTCTTACCCCCGGTCTTGTTGGTGGAGTAATGTCTGGGAATGTAAAAGCAGGTTTCTTGGGCGCAA 
TATTTGCGGGCTTTCTTGCAGGTTATGTTGCAAGGTTTTTAGCAAGAAGATCTGTTCCTGAGTGGTTAAGACCTGT 
AATGCCTATATTTGTAATTCCGCTAATAAGCACCATTATTGTCGGCTTTTTTATGCTGTATTTTGGTGTTTATATT 
GGAAAATTTATGGGGGTGCTTGAGAGTGGGCTTAAATCTTTACAGAGTAATTCGGAAACTTTTGGCGTGTTGGGTA 
AAATTTTCTTAGGCTTAGTACTAGGTTCAATGATTACTGTTGATATGGGCGGACCTTTTAATAAAGTGGCATTTCT 
TTTTGGTGTAGGGCTAATTCCTCAAGTGCCAGAAATAATGGGAATGGTAGCAGCAGCAATTCCTGTTCCTCCTATG 
GCTATGGGGCTTGCAACCTTTTTAGCACCTAAATTGTTTGAAAATGAAGAAAAAGAATCTGGTAAAATAGCCTTTT 
TAATTTCATTTATTGGTATTAGCGAAGGAGCTATTCCTTTTGCTGCTAGTGATCCCGGACGGGTAATCCCTTCGAT 
AGTGGTAGGGGGAGCTGTATCAAGCATTATTGCCGCTTTTTTAGGCGTTGCTAATCATGCTCCACACGGAGGACCA 
ATAGTACTTCCTGTTATTGATAATAAATTTGGGTTTATTATTGCAATTGCTGTTGGAGTTGCGGTTGCAACAGCTT 
TGGTAATTTTTTTGAAATCTTTAAAATTAAAGGAATCTGAATGA 

f519.aa 

MIKIFKKIYILTLVLGMAHLSFASDNYMVRCSKEEDSTTCIAKLKEIKEKKNYDLFSMGIGIGDPIANIMITIPYI 
NIDFGYGGFIGLKSNNFENYLNGGIDVIFKKQIGQYMKIGGGIGIGADWSKTSLIPPNEEEETDYERIGAVIRIPF 
IMEYNFAKNLSIGFKIYPAVGPTILLTKPSILFEGIKFNFFGFGFIKFAFN 

t519.aa 

DNYMVRCSKEEDSTTCIAKLKEIKEKKNYDLFSMGIGIGDPIANIMITIPYINIDFGYGGFIGLKSNNFENYI^ 

IDVIFKKQIGQYMKIGGGIGIGADWSKTSLIPPNEEEETDYERIGAVIRIPFIMEYNFAKNLSIGFKIYPAVGPTI 

LLTKPSILFEGIKFNFFGFGFIKFAFN 
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f519.nt 

ATGATAAAAATTTTTAAAAAAATATACATTTTAACATTAGTATTAGGTATGGCACACCTTTCTTTTGCATCTGACA 
ATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAACCACCTGTATCGCAAAGCTTAAAGAAATAAAAGAAAAGAA 
AAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATCCTATTGCAAATATTATGATTACAATTCCTTATATA 
AATATTGATTTTGGATATGGAGGTTTTATTGGCCTTAAGTCAAACAATTTTGAAAATTATCTAAATGGTGGAATAG 
ACGTTATTTTTAAAAAGCAAATTGGACAATATATGAAAATTGGCGGCGGCATTGGAATAGGTGCGGATTGGTCAAA 
AACATCCCTTATACCCCCTAATGAAGAAGAAGAAACTGATTATGAGAGAATAGGCGCTGTTATAAGAATTCCTTTT 
ATAATGGAATATAATTTTGCAAAAAATTTATCCATAGGATTCAAAATTTATCCTGCAGTAGGGCCAACAATATTAC 
TAACAAAACCAAGCATTTTATTTGAAGGAATTAAATTCAATTTTTTTGGATTTGGATTCATAAAATTTGCATTTAA 
TTAA 

t519.nt 

GACAATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAACCACCTGTATCGCAAAGCTTAAAGAAATAAAAGAAA 
AGAAAAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATCCTATTGCAAATATTATGATTACAATTCCTTA 
TATAAATATTGATTTTGGATATGGAGGTTTTATTGGCCTTAAGTCAAACAATTTTGAAAATTATCTAAATGGTGGA 
ATAGACGTTATTTTTAAAAAGCAAATTGGACAATATATGAAAATTGGCGGCGGCATTGGAATAGGTGCGGATTGGT 
CAAAAACATCCCTTATACCCCCTAATGAAGAAGAAGAAACTGATTATGAGAGAATAGGCGCTGTTATAAGAATTCC 
TTTTATAATGGAATATTU^TTTTGCAAAAAATTTATCCATAGGATTCAAAATTTATCCTGCAGTAGGGCCAACAATA 
TTACTAACAAAACCAAGCATTTTATTTGAAGGAATTAAATTCAATTTTTTTGGATTTGGATTCATAAAATTTGCAT 
TTAATTAA 

f520.aa 

MRMLIiATIILILTTGLLAAQSKSKSMTEDDFDFDKLIAKEESVRRLFGIGFGVGYPLANITI 

VGLKPNNFLPYVVMGVDLLFKDEIHKNTMISGGIGIGADWSKGSPEKSNEKLEEEEENEAQQVASLQNRIGWIRL 
PLVIEYSFLKNIVIGFKAVATIGTTMLLGSPMSFEGARFNFLGTGFIKIYI 

t520 .aa 

QSKSKSMTEDDFDFDKLI^EESVRRLFGIGFGVGYPLANITISVPYVDIDLGYGGFVGLKPNNFLPYVVM 

FKDEIHKNTMISGGIGIGADWSKGSPEKSNEKLEEEEENEAQQVASLQNRIGWIRLPLVIEYSFLKNIVIGFKAV 

ATIGTTMLLGSPMSFEGARFNFLGTGFIKIYI 

f520.nt 

ATGAGAATGCTATTAGCAACAATAATACTTATATTAACAACGGGTTTATTAGCTGCACAATCCAAAAGCAAAAGTA 

TGACTGAAGATGACTTTGATTTTGATAAACTTCTTGCAAAAGAAGAGTCTGTGCGCCGTTTATTTGGCATAGGTTT 

TGGAGTTGGATATCCACTTGCAAACATTACAATATCTGTTCCATATGTAGACATAGACCTTGGGTACGGAGGATTC 

GTAGGGCTTAAACCCAACAATTTCTTGCCCTATGTTGTGATGGGTGTAGATCTTCTATTTAAAGATGAAATACA 

AAAACACTATGATTTCTGGAGGCATTGGAATAGGTGCAGATTGGTCAAAAGGAAGTCCTGAAAAATCAAATG^^ 

ACTTGAAGAAGAGGAAGAAAATGAAGCACAACAAGTAGCTTCTCTTCAAAATAGAATAGGGGTTGTGATAAGATTG 

CCTTTGGTAATAGAGTACAGCTTTCTTAAAAATATTGTGATTGGAOTTAAAGCTGTTGCTACTATTGGi^ 

TGCTACTTGGCAGCCCAATGTCATTTGAAGGAGCTAGATTTAATTTCTTAGGCACAGGCTTTATAAAAATATATAT 

ATAG 

t520,nt 

CAATCCAAAAGCAAAAGTATGACTGAAGATGACTTTGATTTTGATAAACTTCTTGCAAAAGAAGAGTCTGTC 
GTTTATTTGGCATAGGTTTTGGAGTTGGATATCCACTTGCAAACATTACAATATCTGTT 

CCTTGGGTACGGAGGATTCGTAGGGCTTAAACCCAACAATTTCTTGCCCTATGTTGTGATGGGTGTAGATCTTCTA 
TTTAAAGATGAAATACACAAAAACACTATGATTTCTGGAGGCATTGGAATAGGTGCAGATTGGTCAAAAGGAAGTC 
CTGAAAAATCAAATGAAAAACTTGAAGAAGAGGAAGAAAATGAAGCACAACAAGTAGCTTCTCT^ 
AGGGGTTGTGATAAGATTGCCTTTGGTAATAGAGTACAGCTTTCTTAAAAATATTGTGAT^ 
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GCTACTATTGGAACAACTATGCTACTTGGCAGCCCAATGTCATTTGAAGGAGCTAGATTTAATTTCTTAGGCACAG 
GCTTTATAAAAATATATATATAG 

f523 .aa 

MNIKINFFFTLPIGIFLGLFFPLGIYSSLSHAFIRLSYLSLIPFLIFSIPLX3IENIIENKNFKKLFGKTIYYGILT 
NLSGVAVSIIAATIYLPQRIPILEKTIQNTCFFEKEALLETFFPKNIFKIFTSSNPNLLSIYMISIIIGTSFYYAK 
QKGRIAREIJyTLSASNLF YHANGF I WI UJIG 1 1 F ITANYAANLKNFKDYPNYTNS I TFFL^ 

RLTKSFKMIYKGIFVSFQNIIFSGLAKDSYSPYVILIEDIKNERINIKKSIIINIPLINFVSKFGTIFVSVISFFI 
ILKSYSSLPISIYEISYMSTLSFVFVFAFPHIPNSLIYIITMLCSTYTKGIELNVSNITPMLPILISLALLIDFAF 
NI AI IHI INFKELKDQEKIN 

t523 .aa 

lENIIENKNFKKLFGKTIYYGILTNLSGVAVSIIAATIYLPQRIPILEKTIQNTCFFEKEALLETFFPKNIFKIFT 
SSNPNLLSIYMISIIIGTSFYYAKQKGRIARELMLSASNLFYHANGFIVNILNIGIIFITANYAANLKNFKDYPNY 
TNSITFFIJ^WTIIILFVILPTISYRLTKSFKMIYKGIFVSFQNIIFSGLAKDSYSPYVILIEDIKNERINIKKSII 
INIPLINFVSKFGTIFVSVISFFIILKSYSSLPISIYEISYMSTLSFVFVFAFPHIPNSLIYIITMLCSTYTKGIE 
LNVSNITPMLPILISLALLIDFAFNIAI IHI INFKELKDQEKIN 

f 523 .nt 

ATGAATATAAAAATCAATTTTTTTTTCACTTTGCCTATTGGAATCTTTTTAGGATTGTTTTTCCCTCTTGGAATTT 
ATAGCTCCTTATCACATGCTTTTATAAGATTATCATACTTATCTCTTATTCCCTTTTTAATATTTTCAATTCCATT 
AGGAATTGAAAATATTATTGAAAATAAAAACTTTAAAAAGCTTTTTGGTAAAACAATTTATTATGGAATTTTAACT 
AACCTATCTGGAGTTGCTGTATCAATAATAGCTGCAACAATATATCTTCCGCAAAGAATTCCAATACTAGAAAAAA 
CAATACAAAATACATGTTTTTTTGAAAAAGAAGCTTTACTAGAAACATTCTTTCCAAAAAATATTTTCAAAATATT 
TACATCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCAATAATAATAGGCACAAGTTTTTATTATGCAAAA 
CAAAAAGGCAGAATAGCTAGAGAACTGATGCTAAGCGCATCCAATCTTTTTTACCATGCAAATGGGTTTATTGTAA 
ACATATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCAAACTTAAAAAACTTCAAAGATTACCCAAA 
TTATACAAACAGCATAACATTCTTTTTGGCATGGACAATTATAATTTTATTCGTAATATTGCCAACAATTAGTTAT 
AGATTAACAAAAAGTTTTAAAATGATATATAAAGGCATTTTTGTATCATTTCAAAACATAATATTTTCAGGACTTG 
CAAAAGATTCTTATTCCCCTTATGTGATATTAATAGAAGATATTAAAAACGAAAGAATAAATATAAAAAAATCCAT 
AATTATAAACATACCTTTAATAAATTTTGTATCTAAATTTGGCACTATTTTTGTTTCAGTAATATCATTTTTTATA 
ATTTTAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATAAGCTATATGAGCACTTTATCATTTGTTTTTG 
TCTTTGCATTTCCTCATATACCAAATAGTTTAATTTATATAATTACAATGCTTTGCTCTACATATACAAAAGGAAT 
AGAGCTAAATGTTTCAAACATAACACCAATGCTGCCGATATTAATCTCTTTGGCTTTACTAATCGACTTTGCTTTT 
AACATTGCAATCATTCATATAATAAACTTCAAAGAATTAAAAGATCAAGAAAAAATTAATTAA 

£523 .nt 

ATTGAAAATATTATTGAAAATAAAAACTTTAAAAAGCTTTTTGGTAAAACAATTTATTATGGAATTTTAACT 

TATCTGGAGTTGCTGTATCAATAATAGCTGCAACAATATATCTTCCGCAAAGAATTCCAATACTAGAAAAAACAAT 

ACAAAATACATGTTTTTTTGAAAAAGAAGCTTTACTAGAAACATTCTTTCCAAAAAATATTTTCAAAATATTTACA 

TCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCAATAATAATAGGCACAAGTTTTTATTATGCAAAACAAA 

AAGGCAGAATAGCTAGAGT^CTGATGCTAAGCGCATCCAATCTTTTTTACCATGCAAATGGGTTTATTGTAAACAT 

ATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCAAACTTAAAAAACTTCAAAGATTACCCAAATTAT 

ACAAACAGCATAACATTCTTTTTGGCATGGACAATTATAATTTTATTCGTAATATTGCCAACAATTAGTTATAGAT 

TAACAAAAAGTTTTAAAATGATATATAAAGGCATTTTTGTATCATTTCAAAACATAATATTTTCAGGACTTGCAAA 

AGATTCTTATTCCCCTTATGTGATATTAATAGAAGATATTAAAAACGAAAGAATAAATATAAAAAAATCCATAATT 

ATAAACATACCTTTAATAAATTTTGTATCTAAATTTGGCACTATTTTTGTTTCAGTAATATCATTTTTTAT 

TAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATAAGCTATATGAGCACTTTATCATTTGTTTTTGT^ 

TGCATTTCCTCATATACCAAATAGTTTAATTTATATAATTACAATGCTTTGCTCTACATATACAAAAGGAATAGAG 

CTAAATGTTTCAAACATAACACCAATGCTGCCGATATTAATCTCTTTGGCTTTACTAATCGACTTTGCTTTTAACA 

TTGCAATCATTCATATAATAAACTTCAAAGAATTAAAAGATCAAGAAAAAATTAATTAA 
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f526.aa 

MKKEF IMLLLLLQT IMNLNS I NTNTSTS I VKELQKNL Y I FNSKE YQKDKDTLNE F I NS ININDKE I LQS LE K I KNE 
LFIISVFFNNKKGILIALNLGAEINFKYKISPISISIINNEFEITKILIDYGISLNQIDDTGYSPIFWAIYTNNEK 
IFEFLKESGADLSFTLKNRKTPMQAAIETENIKLIKSLEKKKIYIDDNFKKKLKKLKNKEIVRIIjVK 

t526.aa 

NSINTNTSTSIVKELQKNLYIFNSKEYQKDKDTLNEFINSININDKEILQSLEKIKNELFIISVFFNNKKGILIAL 
NLGAEINFKYKISPISISIINNEFEITKILIDYGISLNQIDDTGYSPIFWAIYTNNEKIFEFLKESGADLSFTLKN 
RKTPMQAAIETENIKLIKSLEKKKIYIDDNFKKKLKKLKNKEIVRILVK 

f526.nt 

ATGAAAAAAGAATTCATTATGCTTTTACTGTTATTGCAAACAATAATGAATTTAAACTCAATAAATACTAATACAA 
GTACTTCAATAGTAAAAGAATTGCAAAAAAATTTATATATTTTCAATAGCAAAGAATATCAAAAAGATAAAGACAC 
TTTAAATGAATTTATAAATTCAATAAATATAAATGACAAAGAAATCTTACAAAGTTTAGAAAAAATCAAAAATGAG 
CTTTTTATAATATCTGTTTTTTTCAACAATAAAAAAGGGATTTTAATTGCACTAAATCTTGGAGCAGAAATAAACT 
TTAAATATAAAATATCTCCAATTTCAATTTCAATAATAAACAATGAATTTGAAATCACAAAAATATTGATAGATTA 
CGGAATAAGCCTTAATCAAATAGATGATACAGGTTATTCTCCAATATTTTGGGCAATATATACTAATAACGAAAAA 
ATATTTGAATTTTTAAAAGAAAGCGGAGCTGATTTAAGTTTCACACTTAAAAATAGAAAAACACCAATGCAAGCCG 
CAATAGAAACAGAAAATATAAAACTAATTAAATCTCTGGAAAAGAAAAAAATTTACATTGACGACAATTTCAAAAA 
AAAACTTAAAAAGCTTAAAAACAAAGAAATAGTTCGAATTTTAGTAAAATAG 

t526 .nt 

AACTCAATAAATACTAATACAAGTACTTCAATAGTAAAAGAATTGCAAAAAAATTTATATATTTTCAATAGCAAAG 
AATATCAAAAAGATAAAGACACTTTAAATGAATTTATAAATTCAATAAATATAAATGACAAAGAAATCTTACAAAG 
TTTAGAAAAAATCAAAAATGAGCTTTTTATAATATCTGTTTTTTTCAACAATAAAAAAGGGATTTTAATTGCACTA 
AATCTTGGAGCAGAAATAAACTTTAAATATAAAATATCTCCAATTTCAATTTCAATAATAAACAATGAATTTGAAA 
TCACAAAAATATTGATAGATTACGGAATAAGCCTTAATCAAATAGATGATACAGGTTATTCTCCAATATTTTGGGC 
AATATATACTAATAACGAAAAAATATTTGAATTTTTAAAAGAAAGCGGAGCTGATTTAAGTTTCACACTTAAAAAT 
AGAAAAACACCAATGCAAGCCGCAATAGAAACAGAAAATATAAAACTAATTAAATCTCTGGAAAAGAAAAAAATTT 
ACATTGACGACAATTTCAAAAAAAAACTTAT^AAAGCTTAAAAACAAAGAAATAGTTCGAATTTTAGTAAAAT^^ 

f544.aa 

MTKNRI IWLLVLMVSSTFTATI ISNYQNLMLSLWLANFI PLLMDTSGNAGSQASALI IRELALGTVK^ 

LKEICVSILVGAILASVNFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTVAKILGGLLPIVAKLLKLDPALMAGPL 

ITTIADAITL.IAYFNIAKWVLVSYAV 

t544.aa 

stftatiisl^qn]jy[lslvvlanfipllmdtsgnagsqasaliireijalgt\^ 

svnflrivffvaphhsdklkiafwssclmvsltvakilggllpivakllkldpalmagplittiadaitliayfn 

lAKWVLVSYAV 
f544.nt 

atgacaaaaaatagaataatttggcttttagttcttatggtgtcttctacttttacagctacaattatttcaaatt 

atcaaaatttaatgttgtctttagtggttttagctaattttattccccttttaatggatacttcaggcaatgccgg 

ctctcaggcatctgcgctaataattcgtgagcttgctcttggtactgtcaaggtaaaagatttttttaaagtgttt 

ttaaaggaaatatgtgttagcattctagtgggagcaattcttgctagtgttaattttttaagaattgtcto 

tagctccacaccattctgataagctgaaaatagcttttgtagtttcatcttgcttgatggtaagtt^ 

aaagatattgggaggtcttttacccattgttgctaaacttttaaagttggatccagcacttatggca 
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ATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGCTG 
TTTAA 

t544.nt 

TCTACTTTTACAGCTACAATTATTTCAAATTATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTC 
CCCTTTTAATGGATACTTCAGGCAATGCCGGCTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTAC 
TGTCAAGGTAAAAGATTTTTTTAAAGTGTTTTTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCT 
AGTGTTAATTTTTTAAGAATTGTCTTTTTTGTAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTT 
CATCTTGCTTGATGGTAAGTTTGACAGTAGCAAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAA 
GTTGGATCCAGCACTTATGGCAGGCCCTTTAATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAAT 
ATAGCAAAATGGGTTTTAGTTAGCTATGCTGTTTAA 

f545.aa 

MTKNRI IWLLVLMVSSTFTATI I SNYQNLMLSLWLANFI PLLMDTSGNAGSQASALI IRELALGTVKVKDFFKVF 
LKEICVSILVGAILASVNFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTVAKILGGLLPIVAKLLKLDPALMAGPL 
ITTIADAITLIAYFNIAKWVLVSYAV 

t545.aa 

GSQASALIIREI^LGTVKVKDFFKVFLKEICVSILVGAILASVNFLRIVFFVAPHHSDKLKIAFVVSSCLrMVSLTV 
AKILGGLLPIVAKLLKLDPALMAGPLITTIADAITLIAYFNIAKWVLVSYAV 

f545.nt 

ATGACAAAAAATAGAATAATTTGGCTTTTAGTTCTTATGGTGTCTTCTACTTTTACAGCTACAATTATTTCAAATT 
ATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTCCCCTTTTAATGGATACTTCAGGCAATGCCGG 
CTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGTTT 
TTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTTTG 
TAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTAGC 
AAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTTTA 
ATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGCTG 
TTTAA 

t545.rLt 

GGCTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGT 
TTTTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTT 
TGTAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTA 
GCAAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTT 
TAATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGC 
TGTTTAA 

f577.aa 

MRIKNLILIAILLISPSCSTNKNIVVLTDNKTIPFYINQFNIENKANFIIKFRNNIDLQTIEKE^ 

TNIANHFKSVKIlSrraPDYPILKHIFKQFlSrYKIIPLGFDIPILIYKNTHHIKKYINTKYLKEEYENFIKDGKFFIS 

YVSENLFYVISQINNWFSFEKNKLNYNENQILKMLEYFSSFLNTKQMDLQKDFFNKYGYLKLN 

GLSDITFYNSLSEQEKSQIKFSYLINDlSnsrEIVISNPNFIGILETSVLTKKFINWILYKKTQKTLIGFNNQSQSNIC 

FGFANGFTPYKELNLKIKHSIDGISPFIIDETQINSHSYVLSKKTIEKENLLINEWFFSKANNLKKNK^ 

t577.aa 

NKNIVVLTDNKTI PFYINQFNIENKANFI IKFRISnsriDLQTIEKENAQI 1 1 SKNIGNTNIANHFKSV^ 
LKHIFKQFNYKIIPI^FDIPILIYKNTHHIKKi^IOTKYLKEEYENFIKIXSKFFISPYVSENLFW 
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EKNKLNYNENQ I LKMLEYFS S F LNTKQMDLQKDFFNKYG YliKIJSFK I LIJ^ 

FSYLINDNNEIVISNPNFIGILETSVLTKKFINWILYKKTQKTLIGFNNQSQSNICFGFANGFTPYKELN^^ 
IDGISPFIIDETQINSHSYVLSKKTIEKENLLINEWFFSKANNLKKNKN 

£577 .nt 

ATGAGAATAAAAAATTTAATACTAATAGCAATTTTATTAATTAGCCCTAGCTGTTCAACAAATAAGAACATCGTTG 

TACTAACTGACAATAAAACAATACCATTTTATATAAATCAATTTAATATAGAAAATAAAGCAAATTTTATAATTAA 

GTTTAGAAATAATATTGATCTGCAAACAATAGAAAAAGAAAATGCACAAATAATTATTTCTAAA^ 

ACAAATATTGCTAACCATTTTAAATCTGTAAAAATCAATTATAATCCAGATTATCCTATCTTAAAGCATATTTTCA 

AGCAATTTAACTACAAAATTATTCCATTGGGCTTTGACATTCCTATTTTAATCTATAAAAATACACATCATATTAA 

AAAATACATAAACACTAAATATCTAAAAGAAGAATACGAAAATTTCATTAAAGATGGAAAATTTTTTATATCGCCT 

TATGTTTCTGAAAATTTATTTTATGTGATTTCTCAAATAAATAATGTGAGATTTTCTTTTGAAAAAAATAAATTAA 

ATTATAATGAGAATCAAATTTTAAAAATGCTAGAATATTTCTCATCATTTTTAAATACAAAACAAATGGACTTGCA 

AAAAGATTTCTTTAATAAATACGGCTACCTAAAGTTAAATAAAATATTGCTTAATAAAAAATCTCTTTTAATAGCA 

GGATTGAGCGATATAACCTTCTACAATAGCTTAAGCGAACAAGAGAAGTCACAAATAAAATTTTCCTATTTAATAA 

ACGATAACAATGAAATTGTTATCTCAAACCCAAATTTTATTGGCATTTTAGAAACATCTGTTTTAACTAAAAAATO 

TATCAACTGGATATTGTATTU^AAAAACTCAAAAAACCCTAATTGGATTTAACAATCAATCCCAATCAAATATATGT 

TTTGGATTTGCCAATGGTTTTACCCCTTACAAAGAATTAAATTTAAAAATAAAACATTCAATTGATGGAATATCTC 

CTTTTATTATTGACGAAACTCAAATCAATAGCCATTCCTATGTATTAAGCAAAAAAACAATTGAAAAAGAAAACTT 

ACTAATAAATGAATGGTTTTTCTCTAAAGCTAATAATCTAAAAAAAAATAAAAATTAA 

t577 .nt 

AATAAGAACATCGTTGTACTAACTGACAATAAAACAATACCATTTTATATAAATCAATTTAATATAGAAAATAAAG 
CAAATTTTATAATTAAGTTTAGAAATAATATTGATCTGCAAACAATAGAAAAAGAAAATGCACAAATAATTATTTC 
TAAAAACATTGGTAACACAAATATTGCTAACCATTTTAAATCTGTAAAAATCAATTATAATCCAGATTATCCTATC 
TTAAAGCATATTTTCAAGCAATTTAACTACAAAATTATTCCATTGGGCTTTGACATTCCTATTTTAATCTATAAAA 
ATACACATCATATTAAAAAATACATAAACACTAAATATCTAAAAGAAGAATACGAAAATTTCATTAAAGATGGAAA 
ATTTTTTATATCGCCTTATGTTTCTGAAAATTTATTTTATGTGATTTCTCAAATAAATAATGTGAGATTTTCTTTT 
GAAAAAAATAAATTAAATTATAATGAGAATCAAATTTTAAAAATGCTAGAATATTTCTCATCATTTTTAAATACAA 
AACAAATGGACTTGCAAAAAGATTTCTTTAATAAATACGGCTACCTAAAGTTAAATAAAATATTGCTTAATAAAAA 
ATCTCTTTTAATAGCAGGATTGAGCGATATAACCTTCTACAATAGCTTAAGCGAACAAGAGAAGTCACAAATAAAA 
TTTTCCTATTTAATAAACGATAACAATGAAATTGTTATCTCAAACCCAAATTTTATTGGCATTTTAGAAACATCTG 
TTTTAACTAAAAAATTTATCAACTGGATATTGTATAAAAAAACTCAAAAAACCCTAATTGGATTTAACAATCAATC 
CCAATCAAATATATGTTTTGGATTTGCCAATGGTTTTACCCCTTACAAAGAATTAAATTTAAAAATAAAACATTCA 
ATTGATGGAATATCTCCTTTTATTATTGACGAAACTCAAATCAATAGCCATTCCTATGTATTAAGCAAAAAAACAA 
TTGAAAAAGAAAACTTACTAATAAATGAATGGTTTTTCTCTAAAGCTAATAATCTAAAAAAAAATAAAAA 

f584.aa 

MIKTILLLVLYPVWFSQISANQYFEGIYAKYQNIEDMQATINFTLKGLKQTGVLLYKFPDKFIINLDSNNQVFVS 

DGEFLTVYVPSLiGTSFNQQLLKGSSGGGLMKVLNSEYSVSYTNSPNLEDLDSSEPGKYIKLTFSRKLYKGA^ 

FIIAFAPDGIIRRITAFPTSGGREIVIDLTAVKFNVGILDSKFKYDPPKSSNKVDNFLYDIKKN 



t584 .aa 

QISANQYFEGIYAKYQNIEDMQATINFTLKGLKQTGVLLYKFPDKFIINLDSNNQVFVSDGEFLTVYVPSLGTSFN 
QQLLKGSSGGGLMKVLNSEYSVSYTNSPNLEDLDSSEPGKYIKLTFSRKLYKGAATINSFIIAFAPDGIIRRITAF 
PTSGGREIVIDLTAVKFNVGILDSKFKYDPPKSSNKVDNFLYDIKKN 

f584.nt 

ATGATAAAAACAATACTTTTATTAGTTTTGTATCCTGTTGTTGTGTTTTC 

AAGGAATTTATGCTAAATATCAAAATATAGAGGACATGCAAGCAACAATTAATTTTACTTTAAAGGGG 
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AACAGGTGTTTTGCTTTATAAGTTTCCAGACAAGTTTATTATCAATTTAGATTCAAATAATCAAGTTTTTG^ 

GATGGTGAATTTTTGACAGTTTATGTTCCATCTCTTGGGACTTCTTTTAATCAGCAATTATTAAAGGGTAGTAGTG 

GGGGAGGTCTTATGAAAGTTTTAAATAGTGAGTATAGCGTATCTTATACCAATTCTCCAAATTTAGAAGATC 

TTCATCTGAGCCTGGAAAATATATTAAATTAACCTTTTCTAGAAAGCTTTACAAGGGGGCTGCTACTATTAATTCT 

TTTATTATTGCTTTTGCTCCGGATGGAATAATTAGAAGAATTACTGCTTTTCCTACTAGTGGTGGGCGCGAAATAG 

TTATTGATTTGACTGCTGTGAAGTTTAATGTTGGAATTCTTGATAGCAAATTTAAATATGATCCTCCAAAATCTTC 

AAATAAGGTAGATAATTTTTTATATGATATTAAAAAAAATTAA 

t584.nt 

CAAATATCTGCAAATCAATATTTTGAAGGAATTTATGCTAAATATCAAAATATAGAGGACATGCAAGCAACAATTA 
ATTTTACTTTAAAGGGGTTAAAGCAAACAGGTGTTTTGCTTTATAAGTTTCCAGACAAGTTTATTATCAATTTAGA 
TTCAAATAATCAAGTTTTTGTAAGTGATGGTGAATTTTTGACAGTTTATGTTCCATCTCTTGGGACTTCTTTTAAT 
CAGCAATTATTAAAGGGTAGTAGTGGGGGAGGTCTTATGAAAGTTTTAAATAGTGAGTATAGCGTATCTTATACCA 
ATTCTCCAAATTTAGAAGATCTCGATTCATCTGAGCCTGGAAAATATATTAAATTAACCTTTTCTAGAAAGCTTTA 
CAAGGGGGCTGCTACTATTAATTCTTTTATTATTGCTTTTGCTCCGGATGGAATAATTAGAAGAATTACTGCTTTT 
CCTACTAGTGGTGGGCGCGAAATAGTTATTGATTTGACTGCTGTGAAGTTTAATGTTGGAATTCTTGATAGCAAAT 
TTAAATATGATCCTCCAAAATCTTCAAATAAGGTAGATAATTTTTTATATGATATTAAAAAAAATTAA 

f596.aa 

MKERCLYLLVFVALCVIWLFSDDYLIYDFDLSLNEFLEVSTRKDNLEPMVDSNRILLFYPPKKEIRKIFAAFDFDQ 
YSKKYLFKKNEHGVFFVKWIPHGTSSIKYRLIVDGWTNDEYNKNVVYNEDLIPFSKIEIAKEKSSYISLRNPIQ 
SYDNNEIEIFYIGRPGQIVTIAGSFNNFNPFLNRLIEKEDNKGIYTIKLKNLPKDRIYYYFIDSGNKVIDKNNVNR 
INLYFVEGIDNKIDFEVSYFDHK 

t596 .aa 

DDYLIYDFDLSLNEFLEVSTRKDNLEPMVDSNRILLFYPPKKEIRKIFAAFDFDQYSKKYLFKKNEHGVFFVKVNI 

PHGTSSIKYRLIVIDGWTNDEYNKIWVYNEDLIPFSKIEIAKEKSSYISLRNPIQSYDlSnSrEIEIFYI 

AGSFNNFNPFLNRLIEKEDNKGIYTIKLKNLPKDRIYYYFIDSGNKVIDKNNVNRINLYFVEGIDNKID^ 

HK 

f596.nt 

ATGAAAGAAAGGTGTTTGTATTTATTGGTTTTTGTAGCTTTATGTGTTAACAATCTTTTTTCAGATGATTATTTAA 

TTTATGACTTTGATTTAAGTTTAAATGAATTTCTAGAAGTTTCAACAAGAAAAGACAATCTTGAGCCTATGGTTGA 

TTCCAATCGTATATTATTGTTTTATCCTCCTAAAAAAGA/^TTAGAAAAATTTTTGCTGCCTTTGACTTTGATCAG 

TATTCTAAGAAATATTTATTCAAAAAAAATGAGCATGGAGTTTTTTTTGTTAAAGTTAATATTCCTCATGGCACAA 

GCAGTATAAAATATAGGCTTATTGTAGACGGTGTTTGGACTAATGACGAGTATAATAAAAATGTAGTTTATAATGA 

GGATTTAATCCCATTTTCTAAAATTGAGATCGCTAAAGAGAAGTCCAGCTATATTTCTTTGAGAAATCCAATACAA 

TCATATGATAACAATGAAATTGAAATTTTTTACATAGGTCGTCCTGGACAAATAGTTACAATAGCTC^ 

ACAATTTTAATCCTTTTTTAAATAGGCTTATTGAGAAAGAGGACAATAAGGGAATTTATACTATTAAGCTTAAAAA 

TTTACCCAAGGATAGAATTTATTATTATTTTATTGATTCTGGTAACAAAGTAATAGATAAAAATAATC 

ATTAATTTATATTTTGTTGAGGGAATTGATAATAAAATAGATTTCGAAGTTTCCTATTTTGATCATAAGTAA 

t596.nt 

GATGATTATTTAATTTATGACTTTGATTTAAGTTTAAATGAATTTCTAGAAGTTTCAACAAGAA^ 

AGCCTATGGTTGATTCCAATCGTATATTATTGTTTTATCCTCCTAAAAAAGAAATTAGAAAAATTTT^ 

TGACTTTGATCAGTATTCTAAGAAATATTTATTCAAAAAAAATGAGCATGGAGTO 

CCTCATGGCACAAGCAGTATAAAATATAGGCTTATTGTAGACGGTGTTTGGACTAATGACGAGTATAATAAA^ 

TAGTTTATAATGAGGATTTAATCCCATTTTCTAAAATTGAGATCGCTAAAGAGAAGTCCAGCTATATTTCTTTGAG 

AAATCCAATACAATCATATGATAACAATGAAATTGAAATTTTTTACATAGGTCGTCCTGGACAAATAGT 

GCTGGTAGTTTTAACAATTTTAATCCTTTTTTAAATAGGCTTATTGAGAAAGAGGACAATAAGG^ 

TTAAGCTTAAAAATTTACCCAAGGATAGAATTTATTATTATTTTATTGATTCTGG 
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TAATGTTAATAGAATTAATTTATATTTTGTTGAGGGAATTGATAATAAAATAGATTTCGAAGTTTCCTATTTTGAT 
CATAAGTAA 



£598. aa 

MRQRVMIAMALSCHPSLLIADEPTTALDVTIQEQILLLIKNLSKKFNTSTIFITHDIAWAEICDTVSVMYQ 
EEGTVEEIFNNPKHPYTIGLLKSILTLEHDPNKKLYSTKENPMKITKTSTEEF 

t598 .aa 

EPTTALDWIQEQILLLIKNLSKKFNTSTIFITHDLAWAEICDTVSVMYQGKIVEEGTVEEIFNNPKHPYTIGLL 
KSILTLEHDPNKKLYSTKENPMKITKTSTEEF 

f598.nt 

ATGAGACAAAGAGTTATGATTGCCATGGCTCTTAGCTGTCATCCATCCTTATTAATAGCAGATGAACCAACAACAG 
CCCTTGATGTTACAATCCAAGAGCAAATATTATTATTAATCAAAAACCTATCTAAAAAATTCAATACTTCTACCAT 
ATTTATAACTCATGATCTTGCGGTTGTTGCTGAAATTTGTGATACAGTATCTGTAATGTATCAAGGAAAAATTGTA 
GAAGAAGGAACAGTAGAGGAAATATTTAACAATCCTAAGCATCCTTACACCATTGGGCTTTTAAAATCAATTCTTA 
CGCTAGAACACGATCCAAATAAAAAGCTTTATTCAACAAAAGAAAACCCTATGAAGATCACAAAAACCAGCACCGA 
GGAGTTTTAA 

t598.nt 

GAACCAACAACAGCCCTTGATGTTACAATCCAAGAGCAAATATTATTATTAATCAAAAACCTATCTAAAAAATTCA 
ATACTTCTACCATATTTATAACTCATGATCTTGCGGTTGTTGCTGAAATTTGTGATACAGTATCTGTAATGTATCA 
AGGAAAAATTGTAGAAGAAGGAACAGTAGAGGAAATATTTAACAATCCTAAGCATCCTTACACCATTGGGCTTTTA 
AAATCAATTCTTACGCTAGAACACGATCCAAATAAAAAGCTTTATTCAACAAAAGAAAACCCTATGAAGATCACAA 
AAACCAGCACCGAGGAGTTTTAA 

£600 .aa 

MAIMERSIIGLFIALAFVSWLTVARWRGQVQSLSSSEFIQAAKTLGATNQRIILKHLIPNSIGMIVIFTTIRVPS 
FIMAEAFLSFLGLGISAPMTSWGELVQNGIATFVEYPWKVFIPAIVMTIFLLFMNFLGDGLRDAFDPKDSI 



t600.aa 

RWRGQVQSLSSSEFIQAAKTLGATNQRIILKHLIPNSIGMIVIFTTIRVPSFIMAEAFLSFLGLGISAPMTSWGE 
LVQNGI ATFVEYPWKVF I PAI VMTI FLLFMNFLGDGLRDAFDPKDS I 

£600 .nt 

ATGGCAATAATGGAAAGAAGTATAATCGGCTTATTCATAGCACTTGCATTTGTATCATGGTTAACAGTAGCTCGAG 

TTGTACGAGGCCAAGTACAATCACTATCAAGTTCGGAATTTATACAAGCAGCCAAAACCCTTGGTGCAACAAATCA 

AAGAATAATCTTAAAACACTTGATCCCTAATAGCATTGGAATGATAGTTATATTCACAACAATAAGGGTTCCAAGC 

TTTATTATGGCTGAAGCATTTTTATCCTTTTTAGGACTTGGAATTTCAGCTCCAATGACAAGCTO 

TGCAAAATGGAATTGCTACATTTGTTGAATATCCATGGAAAGTTTTTATTCCAGCTATAGTTATGACAATATTTCT 

ATTATTTATGAACTTTTTAGGTGATGGGCTAAGGGATGCTTTTGATCCAAAAGATAGCATCTAA 

t600.nt 

CGAGTTGTACGAGGCCAAGTACAATCACTATCAAGTTCGGAATTTATACAAGCAGCCAAAACCCTTGGTG 

ATCAAAGAATAATCTTAAT^CACTTGATCCCTAATAGCATTGGAATGATAGTTATATTCACAACAATAAGGGTTCC 

AAGCTTTATTATGGCTGAAGCATTTTTATCCTTTTTAGGACTTGGAATTTCA^ 
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TTAGTGCAAAATGGAATTGCTACATTTGTTGAATATCCATGGAAAGTTTTTATTCCAGCTATAGTTATGACAATAT 
TTCTATTATTTATGAACTTTTTAGGTGATGGGCTAAGGGATGCTTTTGATCCAAAAGATAGCATCTAA 

f603 .aa 

MLKFTLKKILGIIPTLLVIIFLCFFVMRMAPGSPFDSEKPIDPQVKARLMEKYHLDKPFYIQAFYYITNALRGDI^ 
PSLKKKDLTVSQYIKLGFPKSLTIiGVISLIISLSIGIPIGILAAIYKNTYVDYIITSIAILGISIPLFVIGPILQY 
FFAIKWGLLYTSGWITERGGFSNLILPIITLSMPNVAIFARIIRGSMLEIIQSDFIRTARAKGLSFKKIVIKHMLR 
GAMLPWSYIGPAFAAIISGSWIEKIFRIAGMGMFITESALNRDYPVLMGGLLVYSIILLISILISDIIYKILDP 
RV 

t603 .aa 

SPFDSEKPIDPQVKARLMEKYHLDKPFYIQAFYYITNALRGDLGPSLKKKDLTVSQYIKLGFPKSLTLGVISLIIS 
LSIGIPIGIIAAIYKNTYVDYIITSIAILGISIPLFVIGPILQYFFAIKWGLLYTSGWITERGGFSNLILPIITLS 
MPNVAIFARIIRGSMLEIIQSDFIRTARAKGLSFKKIVIKHMLRGAMLPWSYIGPAFAAIISGSWIEKIFRIAG 
MGMFITESALNRDYPVLMGGLLVYS I I LLI SI LI SDI I YKILDPRV 

f603.nt 

ATGTTAAAGTTTACTTTAAAGAAAATATTAGGAATAATACCAACTTTACTGGTAATAATTTTTTTATGCTTTTTTG 
TAATGAGAATGGCTCCTGGAAGTCCATTTGATTCTGAAAAACCTATTGATCCTCAAGTAAAAGCAAGATTGATGGA 
AAAATATCACCTTGACAAGCCTTTTTATATTCAAGCTTTTTATTACATTACAAACGCTCTCAGGGGAGATCTGGGA 
CCTTCTTTGAAAAAGAAAGACCTTACAGTTAGTCAATACATAAAATTAGGATTTCCAAAATCACTTACACTAGGAG 
TAATATCCCTTATTATATCACTATCAATAGGAATACCAATAGGTATATTAGCTGCCATTTATAAAAATACTTATGT 
GGATTATATAATAACATCAATAGCAATATTGGGGATTTCAATACCATTATTCGTAATAGGGCCAATTTTACAATAT 
TTTTTTGCAATTAAATGGGGTTTGCTTTATACCTCTGGATGGATTACAGAAAGAGGAGGATTTTCAAATTTAATTC 
TACCCATAATAACTCTTAGCATGCCCAACGTAGCTATTTTCGCAAGAATAATCAGAGGATCAATGCTAGAAATAAT 
ACAAAGCGACTTTATAAGAACTGCGCGTGCAAAAGGGCTAAGCTTCAAAAAGATAGTTATAAAGCATATGTTAAGA 
GGAGCAATGTTGCCTGTAGTAAGCTATATAGGTCCAGCATTTGCTGCTATAATATCTGGAAGCGTGGTTATTGAAA 
AAATATTTAGAATTGCTCGAATGGGAATGTTTATAACAGAATCCGCACTAAACAGAGATTACCCAGTATTAATGGG 
CGGATTGTTAGTATATTCAATAATACTGCTTATTTCTATATTAATATCAGATATTATATATAAAATATTAGATCCA 
AGAGTATAA 

t603 ,nt 

AGTCCATTTGATTCTGAAAAACCTATTGATCCTCAAGTAAAAGCAAGATTGATGGAAAAATATCACCTTGACAAGC 
CTTTTTATATTCAAGCTTTTTATTACATTACAAACGCTCTCAGGGGAGATCTGGGACCTTCTTTGAAAAAGAAAGA 
CCTTACAGTTAGTCAATACATAAAATTAGGATTTCCAAAATCACTTACACTAGGAGTAATATCCCTTATTATATCA 
CTATCAATAGGAATACCAATAGGTATATTAGCTGCCATTTATAAAAATACTTATGTGGATTATATAATAACATCAA 
TAGCAATATTGGGGATTTCAATACCATTATTCGTAATAGGGCCAATTTTACAATATTTTTTTGCAATTAAATGGGG 
TTTGCTTTATACCTCTGGATGGATTACAGAAAGAGGAGGATTTTCAAATTTAATTCTACCCATAATAACTCTTAGC 
ATGCCCAACGTAGCTATTTTCGCAAGAATAATCAGAGGATCAATGCTAGAAATAATACAAAGCGACTTTATAAGAA 
CTGCGCGTGCAAAAGGGCTAAGCTTCAAAAAGATAGTTATAAAGCATATGTTAAGAGGAGCAATGTTGCCTGTAGT 
AAGCTATATAGGTCCAGCATTTGCTGCTATAATATCTGGAAGCGTGGTTATTGAAAAAATATTTAGAATTGCTGGA 
ATGGGAATGTTTATAACAGAATCCGCACTAAACAGAGATTACCCAGTATTAATGGGCGGATTGTTAGTATATTCAA 
TAATACTGCTTATTTCTATATTAATATCAGATATTATATATAAAATATTAGATCCAAGAGTATAA 

f 607 .aa 

MKYIKIALMLIIFSLIACISNAKKEKIVFRVSNLSEPSSLDPQLSTDLYGSNIITNLFLGLAVKDSQTGKYKPGLA 

KSWNISErX3IIYTFNLREDIVWSIX5VAITAEEIKKSYLRIIiNKKTAAl^ANLIKSTIKNAQEYFDETVP 

AIDSKTLEITLTSPKPYFPDMLTHSAYIPVPMHIV^KYGENWTNPENIWSGAYKLKERSINDKIVIEKNEKYYNA 

KNVEIDEVIFYPTEGSVAYNMYINGELDFIiQGAEKNNLEEIKIRDDYYSGLKNGMAYIAFNTTIKPLDN 

SLAIDRETLTKVVLKGSSDPTRNLTPKFDDYSYGKNLILFDPENAKiCLIJVEAGYPI^ 
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FLQEQFKKILNINLEIENEEWTTFLGSRRTGNYQMSSVGWIGDYFDPLTFLDSLFTTENHFLGAYKYSNKEYDALI 
KKSNF ELDP I KRQDI LRQAEE 1 1 AEKDF PMAPL YI PKSHYLFRNDKWTGWVPNI AE S YLYEDI KTKK 

t607 .aa 

CISNAKKEKIVFRVSNLSEPSSLDPQLSTDLYGSNIITNLFLGLAVKDSQTGKYKPGLAKSWNISEDGIIYTFNLR 
EDIVWSTCVAITAEEIKKSYLRIIJSrKKTAAMYANLIKSTIKNAQEYFDEWPESEUSIKAID 

FPDMLTHSAYIPVPMHIVEKYGENWTNPENIVVSGAYKLKERSINDKIVIEKNEKYYNAKNVEIDEVIFYPTEGSV 
AYNMYINGELDFLQGAEKNNLEEIKIRDDYYSGLKNGMAYIAFNTTIKPLDNLKVRQAISLA^ 

SDPTRNIiTPKFDDYSYGKNLILFDPENAKKLLAEAGYPDGKGFPTLKYKISEGRPTTAEFLQEQFKKILNINLEIE 
NEEWTTFLGSRRTGNYQMSSVGWIGDYFDPLTFLDSLFTTENHFLGAYKYSNKEYDALIKKSNFELDPIKRQDILR 
QAEEI I AEKDFPMAPLYI PKSHYLFRNDKWTGWVPNIAESYLYEDIKTKK 

f 607 .nt 

ATGAAATATATAAAAATAGCCTTAATGCTAATAATTTTTTCTTTAATAGCATGTATTAGTAATGCTAAAAAAGAAA 
AAATAGTTTTCAGAGTATCAAACTTAAGCGAGCCATCATCACTTGATCCTCAACTCTCAACAGACCTTTACGGTAG 
CAACATTATTACAAACCTATTCTTAGGCCTAGCGGTAAAAGATTCTCAAACTGGAAAATATAAACCAGGACTTGCA 
AAAAGTTGGAATATTTCTGAAGATGGAATTATTTACACATTTAACCTAAGAGAAGATATAGTTTGGAGCGATGGAG 
TTGCCATTACTGCCGAGGAGATAAAAAAATCATACCTAAGAATTTTAAATAAAAAAACAGCTGCAATGTATGCTAA 
TTTAATAAAATCTACAATAAAAAATGCACAAGAATATTTCGATGAGACAGTGCCTGAATCTGAGCTTGGCATAAAG 
GCTATTGACAGCAAAACCTTAGAGATAACATTAACATCTCCAAAGCCTTATTTTCCTGATATGCTAACACACTCAG 
CATACATACCAGTTCCAATGCATATTGTTGAAAAATATGGAGAAAATTGGACAAATCCTGAAAATATAGTTGTTAG 
TGGCGCATACAAACTTAAAGAAAGATCAATTAACGATAAAATCGTAATAGAAAAAAATGAAAAATACTATAATGCA 
AAAAATGTAGAAATTGATGAAGTAATATTTTACCCAACAGAAGGTAGCGTGGCTTACAATATGTACATAAACGGTG 
AACTCGATTTTCTACAAGGAGCAGAAAAGAATAATTTAGAAGAAATTAAAATAAGAGATGATTATTATTCTGGGTT 
AAAAAACGGAATGGCATACATAGCATTCAATACAACAATAAAACCACTAGACAATTTAAAAGTTAGACAAGCCATC 
TCCCTTGCCATTGACAGAGAAACTTTAACTAAAGTAGTTTTAAAGGGAAGTTCAGATCCAACAAGAAATCTAACTC 
CAAAATTTGATGATTATTCTTATGGAAT^AAATTTAATACTATTTGATCCTGAGAATGCAAAAAAACTTTTAGCTGA 
AGCTGGATATCCGGATGGGAAAGGATTCCCCACATTAAAATATA7VAATATCGGAGGGAAGACCAACAACAGCAGAA 
TTTTTGCAAGAACAATTTAAAAAAATACTAAACATTAACTTAGAAATCGAGAATGAAGAATGGACAACATTCCTAG 
GAAGCAGAAGAACTGGAAATTACCAAATGTCAAGCGTGGGGTGGATAGGAGATTATTTTGATCCCTTAACATTCTT 
AGACAGCTTATTTACAACAGAAAATCATTTTTTAGGAGCGTACAAATATTCAAACAAAGAGTATGATGCTTTAATA 
AAAAAATCTAATTTTGAACTTGATCCAATAAAAAGACAAGACATTTTAAGACAAGCTGAAGAGATAATAGCAGAAA 
AAGACTTTCCTATGGCACCTTTATATATACCCAAATCTCATTATCTTTTCAGAAATGATAAATGGACAGGGTGGGT 
ACCAAATATCGCAGAAAGCTATTTATATGAAGATATTAAAACTAAAAAATAA 

t607 .nt 

TGTATTAGTAATGCTAAAAAAGAAAAAATAGTTTTCAGAGTATCAAACTTAAGCGAGCCATCATCACTTGATCC 

AACTCTCAACAGACCTTTACGGTAGCAACATTATTACAAACCTATTCTTAGGCCTAGCGGTAAAAGATTCTCAAAC 

TGGAAAATATAAACCAGGACTTGCAAAAAGTTGGAATATTTCTGAAGATGGAATTATTTACACATTTAACCTAAGA 

GAAGATATAGTTTGGAGCGATGGAGTTGCCATTACTGCCGAGGAGATAAAAAAATCATACCTAAGAATTTTAAATA 

AAAAAACAGCTGCAATGTATGCTAATTTAATAAAATCTACAATAAAAAATGCACAAGAATATTTCGATGAGACA^ 

GCCTGAATCTGAGCTTGGCATAAAGGCTATTGACAGCAAAACCTTAGAGATAACATTAACATCTCCAAAGCCTTAT 

TTTCCTGATATGCTAACACACTCAGCATACATACCAGTTCCAATGCATATTGTTGAAAAATATGGAGAAAATTGGA 

CAAATCCTGAAAATATAGTTGTTAGTGGCGCATACAAACTTAAAGAAAGATCAATTAACGATAAAATCGTAATAGA 

AAAAAATGAAAAATACTATAATGCAAAAAATGTAGAAATTGATGAAGTAATATTTTACCCAACAGAAGGTAGCGTG 

GCTTACAATATGTACATAAACGGTGAACTCGATTTTCTACAAGGAGCAGAAAAGAATAATTTAGAAGAAATTAAAA 

TAAGAGATGATTATTATTCTGGGTTAAAAAACGGAATGGCATACATAGCATTCAATACAACAATAAAACCACTAGA 

CAATTTAAAAGTTAGACAAGCCATCTCCCTTGCCATTGACAGAGAAACTTTAACTAAAGTAGTTTTAAAGGGAAGT 

TCAGATCCAACAAGAAATCTAACTCCAAAATTTGATGATTATTCTTATGGAAAAAATTTAATACTATTTGAT^ 

AGAATGCAAAAAAACTTTTAGCTGAAGCTGGATATCCGGATGGGAAAGGATTCCCCACATTAA^ 

GGAGGGAAGACCAACAACAGCAGAATTTTTGCAAGAACAATTTAAAAAAATACTAAACATTAACTTA 

AATGAAGAATGGACAACATTCCTAGGAAGCAGAAGAACTGGAAATTACCAAATGTCAAGCGTGG 

ATTATTTTGATCCCTTAACATTCTTAGACAGCTTATTTACAACAGAAAATCATTTTTTAG^ 
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AAACAAAGAGTATGATGCTTTAATAAAAAAATCTAATTTTGAACTTGATCCAATAAAAAGACAAGACATTTTAAGA 
CAAGCTGAAGAGATAATAGCAGAAAAAGACTTTCCTATGGCACCTTTATATATACCCAAATCTCATTATCTTTTCA 
GAAATGATAAATGGACAGGGTGGGTACCAAATATCGCAGAAAGCTATTTATATGAAGATATTAAAACTAAAAAATA 
A 

f 611.aa 

MKKIFLFLFISFYLFGFEDSSLKIGIDDVYVEAHEEGFHLFIRKKPAIKSVILTESFEIPDKKKDVATYSFRTLSY 
NKVNGDEIRILNGRVIKNKELLSLTSSTPVPNKKFGEAFHILIPKKLKYGFPNFSTRSGDIDLEVLKSKKEPFWFS 
IRSFEKKYNDYLGRYQDNAYELLFKDDQNQGKIEFNELKDTFTKFSDEWIANNGIDIVDKINKILKNSEDSVYDL 
DLVIiWDVTDSMKSNI E I LKEHLFS 1 1 EPQLQKFKS YRIGLVFYKDYLEDFLTKAFDFNTI PYLNNI LKYVNVGGG 
GDYPEAVFEGIDAAVTQFDWRAERRFIIVIGDAPPHEYPRGSIVYKDVINSAKEKDITIYGIIFQ 

t611 .aa 

FEDSSLKIGIDDVYVEAHEEGFHLFIRKKPAIKSVILTESFEIPDKKKDVATYSFRTLSYNKVNGDEIRILNGRVI 
KNKELLSLTSSTPVPNKKFGEAFHILIPKKLKYGFPNFSTRSGDIDLEVLKSKKEPFWFSIRSFEKKYNDYLGRYQ 
DNAYELLFKDDQNQGKIEFNELKDTFTKFSDEVVIANNGIDIVDKINKILKNSEDSVYDLDLVLVVDVTDSMKSNI 
EILKEHLFSIIEPQLQKFKSYRIGLVFYKDYLEDFLTKAFDFNTIPYLNNILKYVNVGGGGDYPEAVFEGIDAAVT 
QFDWRAERRFIIVIGDAPPHEYPRGSIVYKDVINSAKEKDITIYGIIFQ 

feil.nt 

ATGAAGAAAATTTTTTTATTTCTTTTTATTAGTTTTTATTTGTTTGGATTTGAAGATAGTTCTTTGAAAATAGGTA 
TTGATGATGTTTATGTTGAGGCTCATGAAGAGGGATTTCATCTTTTTATTAGAAAAAAACCTGCAATCAAATCAGT 
AATATTGACAGAGTCTTTTGAAATTCCTGATAAGAAAAAAGATGTGGCTACTTATTCATTTCGTACATTAAGTTAT 
AATAAGGTTAATGGAGATGAAATTCGGATTTTAAATGGAAGAGTTATTAAGAATAAAGAACTTTTATCATTGACAT 
CTTCCACCCCTGTTCCTAATAAAAAGTTTGGAGAAGCTTTTCATATATTGATTCCAAAAAAATTAAAATATGGATT 
TCCAAATTTTTCAACAAGAAGTGGTGATATTGACTTAGAAGTATTAAAAAGTAAAAAAGAGCCCTTTTGGTTTTCT 
ATAAGATCTTTTGAGAAAAAATATAATGATTATTTGGGCAGATATCAAGACAATGCTTATGAATTGCTTTTCAAGG 
ATGATCAAAATCAGGGAAAAATTGAATTTAATGAATTAAAAGATACTTTTACAAAATTTTCAGATGAGGTTGTTAT 
TGCTAATAATGGCATTGATATTGTTGATAAAATAAACAAAATTTTAAAAAACTCAGAAGATTCAGTTTATGATTTA 
GATTTAGTGCTTGTTGTTGATGTTACTGATAGTATGAAAAGCAATATTGAGATTCTAAAAGAGCATTTGTTTTCAA 
TAATAGAACCTCAACTTCAAAAGTTTAAATCCTACAGAATAGGTCTTGTTTTTTATAAAGACTATCTTGAAGATTT 
TTTAACCAAAGCTTTTGATTTTAATACTATTCCTTATTTAAATAATATTCTTAAGTATGTTAATGTTGGTGGCGGT 
GGGGATTATCCAGAAGCTGTTTTTGAGGGGATTGATGCTGCTGTGACCCAATTTGATTGGCGGGCAGAAAGAAGG^ 
TTATTATTGTTATAGGAGATGCACCTCCTCATGAGTATCCAAGAGGGTCTATTGTTTATAAAGATGTTATCAATTC 
TGCAAAGGAAAAAGATATTACAATTTATGGAATAATATTTCAGTAA 

t6ll .nt 

TTTGAAGATAGTTCTTTGAAAATAGGTATTGATGATGTTTATGTTGAGGCTCATGAAGAGGGATTTCATCTTTTTA 

TTAGAAAAAAACCTGCAATCAAATCAGTAATATTGACAGAGTCTTTTGAAATTCCTGATAAGAAAAAAGATGTGGC 

TACTTATTCATTTCGTACATTAAGTTATAATAAGGTTAATGGAGATGAAATTCGGATTTTAAATGGAAGAGTTATT 

AAGAATAAAGAACTTTTATCATTGACATCTTCCACCCCTGTTCCTAATAAAAAGTTTGGAGAAGCTTTTCATATAT 

TGATTCCAAAAAAATTAAAATATGGATTTCCAAATTTTTCAACAAGAAGTGGTGATATTGACTTAGAAGTATTA^ 

AAGTAAAAAAGAGCCCTTTTGGTTTTCTATAAGATCTTTTGAGAAAAAATATAATGATTATTTGGGCAGATATCAA 

GACAATGCTTATGAATTGCTTTTCAAGGATGATCAAAATCAGGGAAAAATTGAATT^ 

TTACAAAATTTTCAGATGAGGTTGTTATTGCTAATAATGGCATTGATATTGTTGATAAAATAAACAA^ 

AAACTCAGAAGATTCAGTTTATGATTTAGATTTAGTGCTTGTTGTTGATGTTACTGATAGTATGAAAAGCAAT^ 

GAGATTCTAAAAGAGCATTTGTTTTCAATAATAGAACCTCAACTTCAAAAGTTTAAATCCTACAGAATAGGTCTTG 

TTTTTTATAAAGACTATCTTGAAGATTTTTTAACCAAAGCTTTTGATTTTAATACTATTCCTTATTTAAATAATAT 

TCTTAAGTATGTTAATGTTGGTGGCGGTGGGGATTATCCAGAAGCTGTTTTTC 

CAATTTGATTGGCGGGCAGAAAGAAGGTTTATTATTGTTATAGGAGATC 

CTATTGTTTATAAAGATGTTATCAATTCTGCAAAGGAAAAAGATATTACAATTTATGG^ 
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f617.aa 

MIFFRNSFMALIFSFSILSISYFFGDFFQFSYIKMISWRFILFLIMATGIATCAKSNSLNLGNEGQIYFGAFLVYI 

FSSFFGLTYFNFVFLILLSSFFVGLLGLIPFFITFFFGUSTKALTGLLISYGNQRLVIXSFILNMLKTGSFSNQT^ 

NSLFALDSSLIYLFLLGVSWLFYVFIHKKTIYGLQLEILSNKKKIDIFFNINEFKYKFFAVFGSAFLNGLAGSMF 

VWFRPYLVLGLTSGLGWSSLIVAVISGFNYVYVLFFSLLFSILIEFmFI^INYDFKYEFIGLCQSIAIFISLFL 

IKARKK 

t617.aa 

AKSNSLNLGNEGQIYFGAFLVYIFSSFFGLTYFNFVFLILLSSFFVGLLGLIPFFITFFFGLNKALTGLLISYGNQ 
RLVDGFILNMLKTGSFSNQTKRINSLFALDSSLIYLFLLGVSVWLFYVFIHKKTIYGLQLEILSNKKKIDIFFNIN 
EFKYKFFAVFGSAFLNGLAGSMFVVFFRPYLVLGLTSGLGWSSLIVAVISGFNYVYVLFFSLLFSILIEFNNFLNI 
NYDFKYEF IGLCQS I A I F I S LFLI KARKK 

£617. nt 

ATGATCTTTTTTAGAAATAGCTTTATGGCATTAATTTTTTCTTTTTCAATATTAAGTATTAGCTATTTTTTCGGTG 
ATTTTTTTCAATTTTCTTATATTAAAATGATATCTTGGCGCTTTATTTTATTTTTAATTATGGCTACGGGGATTGC 
TACTTGTGCCAAGAGTAATTCATTAAATCTTGGGAATGAAGGTCAGATTTATTTTGGGGCATTTTTAGTTTATATA 
TTTTCAAGTTTTTTTGGATTAACCTATTTTAATTTTGTATTTTTGATACTTTTAAGTTCTTTTTTTGTAGGACTTT 

AAATCAAAGATTGGTGGATGGATTTATTTTAAATATGTTAAAAACAGGTAGTTTTTCTAATCAGACAAAAAGGATT 
AATAGTTTGTTTGCTTTAGATTCATCACTTATTTACTTGTTTTTGCTTGGTGTATCAGTTTGGCTTTTTTATGTTT 
TTATTCACAAAAAAACTATTTATGGTCTTCAGCTTGAAATATTAAGCAATAAAAAAAAGATAGACATTTTTTTCAA 
TATAAATGAATTTAAATATAAGTTTTTCGCTGTATTTGGCAGTGCTTTTTTAAATGGTCTTGCAGGTTCTATGTTT 
GTAGTGTTTTTTAGACCATATTTGGTTTTAGGGCTAACTTCAGGACTTGGTTGGAGTAGTCTAATTGTTGCTGTAA 
TTTCAGGATTTAATTATGTTTATGTATTATTTTTTAGCTTATTGTTTTCAATATTAATTGAATTTAATAATTTTCT 
TAATATAAATTATGACTTTAAGTATGAATTTATTGGGCTTTGTCAATCAATTGCTATTTTTATCTCTTTATTTTTG 
ATTAAAGCTAGGAAAAAGTAG 

t617.nt 

GCCAAGAGTAATTCATTAAATCTTGGGAATGAAGGTCAGATTTATTTTGGGGCATTTTTAGTTTATATATTTTCAA 
GTTTTTTTGGATTAACCTATTTTAATTTTGTATTTTTGATACTTTTAAGTTCTTTTTTTGTAGGACTTTTGGGGCT 
TATCCCCTTTTTTATTACTTTTTTCTTCGGATTAAATAAAGCCTTAACAGGTCTTTTAATATCTTATGGAAATCAA 
AGATTGGTGGATGGATTTATTTTAAATATGTTAAAAACAGGTAGTTTTTCTAATCAGACAAAAAGGATTAATAGTT 
TGTTTGCTTTAGATTCATCACTTATTTACTTGTTTTTGCTTGGTGTATCAGTTTGGCTTTTTTATGTTTTTATTCA 
CAAAAAAACTATTTATGGTCTTCAGCTTGAAATATTAAGCAATAAAAAAAAGATAGACATTTTTTTCAATATAAAT 
GAATTTAAATATAAGTTTTTCGCTGTATTTGGCAGTGCTTTTTTAAATGGTCT^ 

TTTTTAGACCATATTTGGTTTTAGGGCTAACTTCAGGACTTGGTTGGAGTAGTCTAATTGTTGCTGTAATTTCAGG 
ATTTAATTATGTTTATGTATTATTTTTTAGCTTATTGTTTTCAATATTAATTGAATTTAATAATTTTCTTi^ 
AATTATGACTTTAAGTATGAATTTATTGGGCTTTGTCAATCAATTGCTATTTTTATCTCTTTATTTTTGATTAAAG 
CTAGGAATUU^GTAG 

f 631.aa 

MVVEINSLRTCYLLVLLLLVAYGLVVFYTSSFFLSLELTGNPNFLFFTRLNYLFLSFMVFLVFERISLNFLKKSIF 

PVLIITLFLIMATFLSPSISGAKRWIFFQGVSIQPSEIFKISFTIYLSAYLSKFDPRKNNGISYWIKPMLIFAIFW 

VLIILQNDYSTAIYFAILFFIVLFVSNl^FSYVFAIVVTFLPVSAIFIJldLEPYRVSRIFAFLNPYD^^ 

ASIJSIALKSGGILGKGLGMGEVKLGKLPEANSDFIFSVLGEELGFI^VLFAISLFFLFFYFGYFIAIHSNSRFKFFI 

AFISSLAIFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTMAIiSGLISNVSKNLSNN 



t631.aa 
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RISLNFLKKSIFPVLIITLFLIMATFLSPSISGAKRWIFFQGVSIQPSEIFKISFTIYLSAYLSKFDPRKNNGISY 
WIKPMLIFAIFWVLIILQNDYSTAIYFAILFFIVLFVSNMAFSYVFAIWTFLPVSAIFIJ^ 
YDDPSGKGYQIIASLNALKSGGILGKGIXSMGEVKLGKLPEANSDFIFSVLGEELGFIXSVLFAISLFFL 
AIHSNSRFKFFIAFISSLAIFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTMALSGLISNVSKNLSNN 

f631.nt 

ATGGTTGTAGAGATAAATTCACTTAGGACATGTTATTITGCTTGTTTTGCTGCTATT^ 

TTTTTTATACTTCTTCCTTTTTTCTAAGCTTAGAATTGACAGGTAATCCAAATTTTTTATTTTTCACAAGA^ 
TTATCTTTTTTTAAGTTTTATGGTTTTTCTTGTTTTTGAAAGGATTTCTTTAAATTTTTTAA 

CCTGTATTGATTATAACTCTTTTTTTAATTATGGCAACTTTTTTATCTCCAAGTATTTCTGGAGCAAAGAGATGGA 

TATTCTTTCAAGGTGTTAGCATTCAACCTTCTGAGATTTTTAAAATATCTTTTACTATTTATCTTTCAGCTTATTT 

GAGCAAGTTTGACCCAAGAAAAAACAATGGTATTTCATACTGGATAAAGCCAATGTTGATTTTTGCAATTTTTT^ 

GTGTTAATAATTTTGCAAAACGATTATTCAACAGCTATTTATTTTGCCATTCTTTTTTTTATTGTTTTGTTTGTTT 

CTAATATGGCATTTAGCTATGTTTTTGCTATTGTGGTTACTTTTTTACCAGTTTCTGCTATATTCTTGATGCTTGA 

ACCTTATAGGGTTTCTAGAATTTTTGCCTTTCTCAATCCTTACGATGATCCTTCTGGCAAAGGTTACCAGATAATA 

GCATCTCTTAATGCTTTAAAAAGTGGAGGAATTTTAGGTAAAGGGCTGGGAATGGGAGAGGTAAAACTTGGAAAAT 

TACCAGAGGCCAATTCGGATTTTATTTTTTCAGTTCTTGGAGAAGAATTAGGATTTTTAGGGGTTTTGTTTGCTAT 

AAGCTTGTTTTTTTTGTTTTTTTACTTTGGTTATTTTATAGCTATTCATTCTAATAGTAGGTTTAAATTTTTTATT 

GCATTTATTTCAAGTCTTGCAATTTTTCTTCAAAGCATGATGAATATTTTAATTGCAATCGGTCTTTTGCCTCCTA 

CAGGGATAAATTTACCATTTTTTTCATCTGGGGGATCTTCTATTATTGTTACCATGGCATTGTCTGGCCTTATTTC 

AAATGTTTCAAAAAATTTAAGTAATAATTGA 

t631 .nt 

AGGATTTCTTTAAATTTTTTAAAAAAATCAATATTTCCTGTATTGATTATAACTCTTTTTTTAATTATGGCAACTT 
TTTTATCTCCAAGTATTTCTGGAGCAAAGAGATGGATATTCTTTCAAGGTGTTAGCATTCAACCTTCTGAGATTTT 
TAAAATATCTTTTACTATTTATCTTTCAGCTTATTTGAGCAAGTTTGACCCAAGAAAAAACAATGGTATTTCATAC 
TGGATAAAGCCAATGTTGATTTTTGCAATTTTTTGGGTGTTAATAATTTTGCAAAACGATTATTCAACAGCTATTT 
ATTTTGCCATTCTTTTTTTTATTGTTTTGTTTGTTTCTAATATGGCATTTAGCTATGTTTTTGCTATTGTGGTTAC 
TTTTTTACCAGTTTCTGCTATATTCTTGATGCTTGAACCTTATAGGGTTTCTAGAATTTTTGCCTTTCTCAATCCT 
TACGATGATCCTTCTGGCAAAGGTTACCAGATAATAGCATCTCTTAATGCTTTAAAAAGTGGAGGAATTTTAGGTA 
AAGGGCTGGGAATGGGAGAGGTAAAACTTGGAAAATTACCAGAGGCCAATTCGGATTTTATTTTTTCAGTTCTTGG 
AGAAGAATTAGGATTTTTAGGGGTTTTGTTTGCTATAAGCTTGTTTTTTTTGTTTTTTTACTTTGGTTATTTTATA 
GCTATTCATTCTAATAGTAGGTTTAAATTTTTTATTGCATTTATTTCAAGTCTTGCAATTTTTCTTCAAAGCATGA 
TGAATATTTTAATTGCAATCGGTCTTTTGCCTCCTACAGGGATAAATTTACCATTTTTTTCATCTGGGGGATCTTC 
TATTATTGTTACCATGGCATTGTCTGGCCTTATTTCAAATGTTTCAAAAAATTTAAGTAATAATTGA 

f647.aa 

MKVNNFLSFFFRAFFLLFLIVILFFFVLFFIDFIGMYNTKRYFPEFVRTKLLGETSLVFDHNSNIILDE^ 

EAIDIKNQQIEKLKEDLKLKEDSIJSrKLEFELKQKQKDLDLKQKIIDDIINKYNDEEANILQTAVYLt^ 

RLEDIiNPEI^ISYMRKIEELSKKEGRLSIVPYWLSLMDSKKAAILIRKMSVSSLE 

t647 . aa 

IDFIGMYlOTKRYFPEFVRTKLLGETSLVFDPINSNIILDEARLVKEREAIDIKNQQIEKLKEDLKLKEDSLNKIiEFE 
LKQKQKDLDLKQKI I DDI INKYNDEEANI LQTAVYLMNMPPEDAVKRLEDLNPELAI S YMRKI EELSKKEGRLS IV 
PYWLSLMDSKKAAILIRKMSVSSLE 

f647,nt 

ATGAAAGTGAATAATTTTTTATCGTTCTTTTTTAGGGCATTTTTTT^ 

TTGTATTATTCTTTATTGATTTTATTGGAATGTATAATACTAAAAGATATTTCCCCGAATT 

GTTAGGAGAAACTTCTCTGGTCTTTGATCATAATTCTAATATAATTCTTGATG 

GAAGCTATTGATATTAAGAATCAGCAGATTGAAAAGCTTAAAGAAGATCTAAAGTTAAAAGAAGACAGTTTAA^ 
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AGCTTGAATTTGAGCTTAAGCAAAAGCAGAAAGATTTAGATTTAAAACAAAAAATAATAGATGACATTATAAATAA 
■ ATATAATGATGAGGAAGCAAATATTTTGCAAACAGCTGTATATTTAATGAATATGCCACCAGAAGATGCTGTTAAG 
CGGCTTGAAGATTTAAATCCCGAGCTTGCAATATCTTATATGCGGAAAATTGAAGAGCTTTCCAAAAAAGAAGGTC 
GTTTATCAATTGTTCCTTATTGGTTATCTCTTATGGATTCTAAAAAAGCTGCTATATTGATTAGAAAAATGTCTGT 
TAGTTCATTGGAGTAG 

t647 .nt 

ATTGATTTTATTGGAATGTATAATACTAAAAGATATTTCCCCGAATTTGTAAGAACCAAGTTGTTAGGAGAAACTT 

CTCTGGTCTTTGATCATAATTCTAATATAATTCTTGATGAAGCTAGACTTGTGAAGGAAAGAGAAGCTATTGATAT 

TAAGAATCAGCAGATTGAAAAGCTTAAAGAAGATCTAAAGTTAAAAGAAGACAGTTTAAATAAGCTTGA^ 

CTTAAGCAAAAGCAGAAAGATTTAGATTTAAAACAAAAAATAATAGATGACATTATAAATAAATATAATGATGAGG 

AAGCAAATATTTTGCAAACAGCTGTATATTTAATGAATATGCCACCAGAAGATGCTGTTAAGCGGCTTGAAGATTT 

AAATCCCGAGCTTGCAATATCTTATATGCGGAAAATTGAAGAGCTTTCCAAAAAAGAAGGTCGTTTATCAATTGTT 

CCTTATTGGTTATCTCTTATGGATTCTAAAAAAGCTGCTATATTGATTAGAAAAATGTCTGTTAGTTCATTGGAGT 

AG 

f653.aa 

MLTYGDMVTLLLVFFVTMFSLNDIIFQENVIRIMSASFTGAGFFKGGKTLDFSKLSYLSNSFMSLPSTVRNKQASQ 

TAKNKSMIEFIEKIQSKNIWRQEERGIVISIiAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGH 

TDNIDTDVNGPWKS^mELSAARSV]SmLEHIUS^y'LDQSDVKRIENNFEVSGF 

TTDASLSFPKEIKQ 

t653 .aa 

NDIIFQENVIRIMSASFTGAGFFKGGKTLDFSKLSYLSNSFMSLPSTVRNKQASQTAKNKSMIEFIEKIQSKNIW 
RQEERGIVISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWKSNWELSAA 
RSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDTPEGRAYNRRIDILITTDASLSFPKEIKQ 

f653.nt 

ATGTTGACTTATGGAGACATGGTTACTTTGCTGCTTGTGTTTTTTGTTACAATGTTTTCATTAAATGATATTATTT 

TTCAAGAAAATGTGATAAGAATAATGTCTGCTTCTTTCACGGGTGCTGGATTTTTCAAGGGCGGTAAAACTTTAGA 

TTTTAGTAAATTATCTTATTTGAGTAATAGCTTTATGTCTTTGCCTTCTACTGTGCGCAATAAACAAGCATC 

ACTGCTAAAAATAAATCCATGATTGAATTTATTGAGAAGATTCAGTCTAAAAATATTGTAGTTAGGCAAGAAGAAA 

GAGGTATTGTAATATCTCTTGCAGCAGATGCATTTTTTGATTCTGCTAGTGCAGATGTTAAGCTTGAAGAGAATAG 

AGATTCTATTCAAAAAATAGCATCTTTTATTGGCTTTTTAAGTCCTAGAGGCTATAATTTTAAAATAGA^ 

ACAGATAATATTGATACTGATGTAAATGGACCTTGGAi\AAGCAATTGGGAACTTTCGGCTGCTAGATCTGTTAATA 

TGCTGGAACATATTTTGAACTATTTAGATCAATCTGATGTTAAAAGAATTGAAAATAATTTTGAAGTATCTGGTTT 

TGGTGGAAGTAGGCCTATTGCAACAGACGATACCCCTGAGGGTAGGGCTTATAATAGAAGAATTGATATATTAATT 

ACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATTAAGCAGTAA 

t653.nt 

AATGATATTATTTTTCAAGAAAATGTGATAAGAATAATGTCTGCTTCTTTCACGGGTGCTGGATTTTTCAAGGGCG 

GTAAAACTTTAGATTTTAGTAAATTATCTTATTTGAGTAATAGCTTTATGTCTTTGCCTTCTACTGTGCGCAATAA 

ACAAGCATCTCAGACTGCTAAAAATAAATCCATGATTGAATTTATTGAGAAGATTCAGTCTAAAAATATTGTAGTT 

AGGCAAGAAGAAAGAGGTATTGTAATATCTCTTGCAGCAGATGCATTTTTTGATTCTGCTAGTGCAGATGTTAAGC 

TTGAAGAGAATAGAGATTCTATTCAAAAAATAGCATCTTTTATTGGCTTTTTAAGTCCTAGAGGCTATAATTTTAA 

AATAGAAGGGCATACAGATAATATTGATACTGATGTAAATGGACCTTGGAAAAGCAATTGGGAACTT 

AGATCTGTTAATATGCTGGAACATATTTTGAACTATTTAGATCAATCTGATGTTAAAAGAATTGi^^ 

AAGTATCTGGTTTTGGTGGAAGTAGGCCTATTGCAACAGACGATACCCCTGAGGGTAGGGCTTATAA^^ 

TGATATATTAATTACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATTAAGCAGTAA 
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MRMSWTMGFAYIRSIMGYWLFFFASI^VNFFVNIIQVGFFITFKSLEPRWDKISFNFSRWAKNSFFSAGAFFNL 
FKSLLKWIICLIYYFIIENNIGKISKLSEYTLQSGISIVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKM 
TKEEVKQERKEMEGDPLLRSRIKERMRVILSTNLRVAIPQADWITNPEHFAVAIKWDSETMLAPKVLAKGQDEIA 
LTIKKI ARENIWPLMENKLLARALYAWKVNEEI PREYWEI VSKI LVRVYS ITKKFN 

t664 . aa 

FVNIIQVGFFITFKSLEPRWDKISFNFSRWAKNSFFSAGAFFNLFKSLLKWIICLIYYFIIENNIGKISKLSEYT 
LQSGISIVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKMTKEEVKQERKEMEGDPLLRSRIKERMRVILST 
NLRVAIPQADWITNPEHFAVAIKWDSETMI^PKVIJ^KGQDEIALTIKKIARENIWP^ 
EI PREYWEI VSKI LVRVYS ITKKFN 

£664. nt 

ATGCGTATGAGTGTTTATACTATGGGTTTTGCATATATTAGATCTATCATGGGGTATGTCGTTTTGTTTTTTTTCG 
CATCTTTAGCTGTTAATTTTTTTGTTAATATTATTCAAGTAGGCTTTTTTATTACTTTTAAATCTTTGGAGCCAAG 
GTGGGATAAAATTAGTTTTAATTTTTCCAGATGGGCAAAAAATTCTTTTTTTTCAGCAGGGGCTTTTTTCAATTTG 
TTTAAAAGTTTGTTAAAAGTTGTTATAATATGCTTGATATATTATTTTATTATAGAAAACAATATAGGCAAAATTT 
CTAAGCTTTCGGAGTATACACTTCAATCTGGAATTTCTATTGTGTTAGTGATTGCCTATAAGATATGTTTTTTTTC 
AGTAATGTTTTTGGCAATTGTAGGGGTGTTTGATTATTTGTTTCAAAGATCTCAGTACATTGAGAGTTTGAAAATG 
ACAAAAGAAGAGGTAAAGCAGGAAAGAAAGGAAATGGAAGGTGATCCTTTACTTCGATCTAGAATAAAAGAGAGAA 
TGAGGGTTATTTTAAGTACCAATTTAAGAGTAGCTATTCCTCAAGCAGATGTAGTAATTACAAATCCAGAACATTT 
TGCAGTTGCTATTAAATGGGATAGCGAAACAATGTTAGCTCCAAAGGTGCTTGCAAAAGGTCAAGATGAAATAGCT 
CTCACAATTAAAAAAATTGCAAGAGAAAATAATGTTCCTTTAATGGAAAATAAGCTCCTTGCAAGAGCTCTTTATG 
CTAATGTTAAGGTTAATGAAGAGATTCCAAGAGAATATTGGGAGATTGTTTCAAAAATTCTTGTGAGAGTATATTC 
TATTACTAAAAAGTTTAATTAG 

t664 .nt 

TTTGTTAATATTATTCAAGTAGGCTTTTTTATTACTTTTAAATCTTTGGAGCCAAGGTGGGATAAAATTAGTTTTA 
ATTTTTCCAGATGGGCAAAAAATTCTTTTTTTTCAGCAGGGGCTTTTTTCAATTTGTTTAAAAGTTTGTTAAAAGT 
TGTTATAATATGCTTGATATATTATTTTATTATAGAAAACAATATAGGCAAAATTTCTAAGCTTTCGGAGTATACA 
CTTCAATCTGGAATTTCTATTGTGTTAGTGATTGCCTATAAGATATGTTTTTTTTCAGTAATGTTTTTGGCAATTG 
TAGGGGTGTTTGATTATTTGTTTCAAAGATCTCAGTACATTGAGAGTTTG7UUVATGACAAAAGAAGAGGTAAAGCA 
GGAAAGAAAGGAAATGGAAGGTGATCCTTTACTTCGATCTAGAATAAAAGAGAGAATGAGGGTTATTTTAAGTACC 
AATTTAAGAGTAGCTATTCCTCAAGCAGATGTAGTAATTACAAATCCAGAACATTTTGCAGTTGCTATTAAATGGG 
ATAGCGAAACAATGTTAGCTCCAAAGGTGCTTGCAAAAGGTCAAGATGAAATAGCTCTCACAATTAAAAAAATTGC 
AAGAGAAAATAATGTTCCTTTAATGGAAAATAAGCTCCTTGCAAGAGCTCTTTATGCTAATGTTAAGGTTAATGAA 
GAGATTCCAAGAGAATATTGGGAGATTGTTTCAAAAATTCTTGTGAGAGTATATTCTATTACTAAAAAGTTTAATT 
AG 

f 680.aa 

MFTLSFVLINFIITGILILMLEFNFLKVDFKGNILIiAGIFMGLMQGLGALPGISRSGITIFSASVIGFNRKSAFEI 
SFLSLIPIVFGAILLKHKEFYDIFIWLNFFEINLGALVAFVVGIFSINFFFKMLNNKKLYYFSIYLFALSIIVCYF 
VRI 

t680.aa 

ITGILILMLEFNFLKVDFKGNILLAGIFMGLMQGLGALPGISRSGITIFSASVIGFNRKSAFEISFLSLIPIVFGA 
ILLJCHKEFYDIFMVLNFFEINLGALVAFWGIFSINFFFKMLNNKKLYYFSIYLFALSIIVCYFVRI 



f680.nt 
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ATGTTTACATTGTCTTTCGTTTTAATTAATTTTATTATAACAGGGATTTTAATCTTGATGCTAGAATTTAATTTTT 

TAAAAGTTGATTTTAAAGGTAATATTTTGTTAGCAGGAATTTTTATGGGGCTGATGCAAGGCTTGGGTGCGCTTCC 

AGGAATCTCTCGTTCAGGAATTACGATCTTTTCGGCATCGGTTATTGGATTTAATAGAAAAAGTGCATTTGAAATT 

TCATTTTTATCTTTAATTCCAATAGTTTTTGGAGCGATTTTATTAAAACATAAAGAATTTTATGATATTTTTATGG 

TTTTAAATTTTTTTGAAATAAACTTAGGAGCATTAGTTGCTTTTGTTGTTGGTATTTTCTCAATAAATT^ 

TAAAATGCTTAATAACAAAAAACTGTATTATTTTTCTATATATTTATTTGCACTTTCAATTATAGTTTGTTATTTT 

GTTAGAATATGA 

t680 .nt 

ATAACAGGGATTTTAATCTTGATGCTAGAATTTAATTTTTTAAAAGTTGATTTTAAAGGTAATATTTTGTTAGCAG 
GAATTTTTATGGGGCTGATGCAAGGCTTGGGTGCGCTTCCAGGAATCTCTCGTTCAGGAATTACGATCTTTTCGGC 
ATCGGTTATTGGATTTAATAGAAAAAGTGCATTTGAAATTTCATTTTTATCTTTAATTCCAATAGTTTTTGGAGCG 
ATTTTATTAAAACATAAAGAATTTTATGATATTTTTATGGTTTTAAATTTTTTTGAAATAAACTTAGGAGCATTAG 
TTGCTTTTGTTGTTGGTATTTTCTCAATAAATTTCTTTTTTAAAATGCTTAATAACAAAAAACTGTATTATTTTTC 
TATATATTTATTTGCACTTTCAATTATAGTTTGTTATTTTGTTAGAATATGA 

f 688 . aa 

MIVLLISIGCANAVHIINEIFKLIKKEQLSKESIKATIKKLKTPILLTSFTTAFGFLSLTTSSINAYKTMGIFMSI 
GVIISMIISLTVLPGIITLIPFAKKKSFEKEKENKLNKISFLERLAKLNTQITKSILKRKYTSSIMVLIILGISFV 
GLLKIEINFDEKDYFKESTSVKKTLNLMQKEMGGISIFKIEIEGRPGEFKNAKAMQILDLITDKLDAFSAKTQSSS 
INGILKFTNFKIKKESPLEYKLPENKIILNKLINLIDKSDWTKDNKRMYINDDWSLISIIVRIEDNSTEGIKKFEK 
YAINTINEYMKNNKYHFSGVYDKVLI AKTMVKEQVINI ITTLGSITLLLMFFFKSIKTGI 1 1 AI PVAWSVFLNFAV 
MRLFGITLNPATATIASVSMGVGVDYSIHFFNTFILQYQKNQIYKTALLESIPNVFNGIFANSISVGIGFLTLTFS 
SYKIISTLGAIIAFTMLTTSLASLTLLPLLIYLFKPRVKLASNNNFKKLKQZ 

t688 . aa 

YKTMGIFMSIGVIISMIISLTVLPGIITLIPFAKKKSFEKEKENKLNKISFLERLAKLNTQITKSILKRKYTSSIM 
VLIILGISFVGLLKIEINFDEKDYFKESTSVKKTLNLMQKEMGGISIFKIEIEGRPGEFKNAKAMQILDLITDKLD 
AFSAKTQSSSINGILKFTNFKIKKESPLEYKLPENKIILNKLINLIDKSDWTKDNKRMYINDDWSLISIIVRIEDN 
STEGIKKFEKYAINTINEYMKNNKYHFSGVYDKVLIAKTMVKEQVINIITTLGSITLLLMFFFKSIKTGIIIAIPV 
AWSWLNFAVMRLFGITLNPATATIASVSMGVGVDYSIHFFNTFILQYQKNQIYKTALLESIPNVFNGIFANSISV 
GIGFLTLTFSSYKIISTLGAIIAFTMLTTSLASLTLLPLLIYLFKPRVKLASNNNFKKLKQZ 

f688.nt 

ATGATTGTTTTACTTATTTCAATCGGATGCGCCAATGCTGTACATATAATAAATGAAATATTTAAATTAATA/^AAA 

AAGAACAGCTCTCAAAAGAATCCATAAAAGCAACAATTAAAAAACTTAAAACACCCATCCTGCTAACATCTTTTAC 

AACTGCATTTGGATTTTTATCTCTTACAACCTCTTCAATTAATGCCTACAAAACAATGGGTATTTTCATGTCAATT 

GGAGTAATTATCTCAATGATAATCTCATTAACCGTTTTACCTGGAATAATAACATTAATCCCATTTGCAAAAAAAA 

AGTCTTTTGAAAAAGAAAAAGAAAATAAACTAAATAAAATATCCTTCCTTGAAAGACTTGCCAAACTAAATACGCA 

AATAACAAAATCTATATTAAAAAGAAAATATACATCCTCTATAATGGTCCTCATCATACTGGGAATTTCTTTTGTA 

GGTCTTTTAAAAATCGAAATCAATTTTGATGAAAAAGATTACTTTAAAGAAAGCACAAGTGTAAAAAAA^ 

ACCTAATGCAAAAAGAAATGGGGGGAATATCGATTTTCAAAATAGAAATTGAAGGCAGGCCCGGTGAATTTAAAAA 

TGCTAAAGCAATGCAAATATTAGACTTAATTACAGATAAGCTTGATGCATTTTCTGCAAAAACTCAATCTAGTTCT 

ATTAATGGCATTTTAAAATTTACAAATTTTAAAATTAAAAAAGAATCCCCACTAGAGTATAAACTGCCTGAAAATA 

AAATTATACTAAACAAACTAATAAATTTGATAGATAAAAGCGATTGGACTAAGGACAATAAAAGAATGTACATTAA 

CGATGACTGGTCATTAATATCTATCATAGTAAGAATTGAAGACAACTCAACCGAAGGAATAAAAAAATTTGAAAAA 

TATGCTATTAACACAATTAATGAATATATGAAAAATAATAAATATCATTTCTCAGGTGTTTATGATAAGGTATTAA 

TAGCTAAAACAATGGTAAAAGAACAGGTTATAAACATTATAACAACTCTTGGATCAATAACACTACTACTTATGTT 

TTTCTTTAAATCTATAAAAACCGGAATAATTATTGCAATCCCAGTAGCATGGTCAGTGTTTTTAAACTTTGCTC 

ATGAGATTATTTGGGATAACCTTAAACCCCGCAACGGCAACAATTGCATCTGTAAGCATGGGAGTAGGA 

ATTCAATTCATTTTTTCAATACATTTATTTTACAATACCAAAAAAATCAAATCTACAAAACTGCACTTCTTC 

AATACCCAATGTATTTAATGGAATATTTGCAAATTCTATTTCTGTTGGAATAGGATTTTTAACTCT 
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TCTTATAAAATAATATCAACTCTTGGAGCAATAATTGCTTTTACAATGCTAACGACATCTCTTGCATCACTAACTC 
TTCTTCCATTATTAATTTATTTATTTAAACCTAGAGTAAAGCTAGCCTCAAACAACAATTTTAAAAAATTAAAAC^ 
ATAA 

t688 .nt 

TACAAAACAATGGGTATTTTCATGTCAATTGGAGTAATTATCTCAATGATAATCTCATTAACCGTTTTACCTGGAA 
TAATAACATTAATCCCATTTGCAAAAAAAAAGTCTTTTGAAAAAGAAAAAGAAAATAAACTAAATAAAATATCCTT 
CCTTGAAAGACTTGCCAAACTAAATACGCAAATAACAAAATCTATATTAAAAAGAAAATATACATCCTCTATAATG 
GTCCTCATCATACTGGGAATTTCTTTTGTAGGTCTTTTAAAAATCGAAATCAATTTTGATGAAAAAGATTACTTTA 
AAGAAAGCACAAGTGTAAAAAAAACATTAAACCTAATGCAAAAAGAAATGGGGGGAATATCGATTTTCAAAATAGA 
AATTGAAGGCAGGCCCGGTGAATTTAAAAATGCTAAAGCAATGCAAATATTAGACTTAATTACAGATAAGCTTGAT 
GCATTTTCTGCAAAAACTCAATCTAGTTCTATTAATGGCATTTTAAAATTTACAAATTTTAAAATTAAAAAAGAAT 
CCCCACTAGAGTATAAACTGCCTGAAAATAAAATTATACTAAACAAACTAATAAATTTGATAGATAAAAGCGATTG 
GACTAAGGACAATAAAAGAATGTACATTAACGATGACTGGTCATTAATATCTATCATAGTAAGAATTGAAGACAAC 
TCAACCGAAGGAATAAAAAAATTTGAAAAATATGCTATTAACACAATTAATGAATATATGAAAAATAATAAATATC 
ATTTCTCAGGTGTTTATGATAAGGTATTAATAGCTAAAACAATGGTAAAAGAACAGGTTATAAACATTATAACAAC 
TCTTGGATCAATAACACTACTACTTATGTTTTTCTTTAAATCTATAAAAACCGGAATAATTATTGCAATCCCAGTA 
GCATGGTCAGTGTTTTTAAACTTTGCTGTAATGAGATTATTTGGGATAACCTTAAACCCCGCAACGGCAACAATTG 
CATCTGTAAGCATGGGAGTAGGAGTAGATTATTCAATTCATTTTTTCAATACATTTATTTTACAATACCAAAAAAA 
TCAAATCTACAAAACTGCACTTCTTGAATCAATACCCAATGTATTTAATGGAATATTTGCAAATTCTATTTCTGTT 
GGAATAGGATTTTTAACTCTAACATTTTCGTCTTATAAAATAATATCAACTCTTGGAGCAATAATTGCTTTTACAA 
TGCTAACGACATCTCTTGCATCACTAACTCTTCTTCCATTATTAATTTATTTATTTAAACCTAGAGTAAAGCTAGC 
CTCAAACAACAATTTTAAAAAATTAAAACAATAA 

f704.aa 

MNYTKFQEFISEFLGTFILLALGTGSVAMTVLFSSSPEIPGEIIKGGYTNIVFGWGLGVTFGIYTAARMSGAHLNP 
AVSIGLASVGKFPVSKLLHYIVAQILGAFTGALMTLWFYPKWIEMDPGLENTQGIMATFPAVPGFLPGFIDQIFG 
TFLLMFLISWGDFTKKHSDNPFIPFIVGAWLSIGISFGGMNGYAINPARDLGPRILLLFAGFKNHGFNNLiSIVI 
VPIIGPIIGAILGATIYEFTLKNNKD 

t7 04 . aa 

GEIIKGGYTNIVFGWGLGVTFGIYTAARMSGAHLNPAVSIGLASVGKFPVSKLLHYIVAQILGAFTGALMTLWFY 
PKWIEMDPGLENTQGIMATFPAVPGFLPGFIDQIFGTFLLMFLISWGDFTKKHSDNPFIPFIVGAWLSIGISFG 
GMNGYAINPARDLGPRILLLFAGFKNHGFNNLSIVIVPIIGPIIGAILGATIYEFTLKNNKD 

f704.nt 

ATGAATTATACAAAATTCCAAGAATTTATATCGGAATTTTTGGGAACATTTATCCTATTGGCTCTAGGAACTGGAT 

CTGTTGCAATGACAGTATTATTTTCCTCAAGTCCCGAAATACCAGGAGAAATAATAAAAGGAGGATATACA7VATAT 

AGTATTTGGATGGGGATTGGGTGTAACGTTTGGTATTTACACAGCAGCAAGAATGAGCGGAGCACACCTAAACCCA 

GCTGTTAGCATAGGATTAGCAAGTGTTGGAAAGTTTCCCGTTTCAAAACTTTTACATTACATTGTAGCACAAATAT 

TAGGAGCTTTTACAGGTGCATTAATGACACTTGTCGTATTTTATCCTAAATGGATAGAAATGGATCCTGGCTTAGA 

AAATACTCAAGGAATAATGGCAACTTTCCCTGCTGTTCCTGGATTTTTGCCTGGATTTATTGATCAAATTTT^ 

ACTTTTTTGCTAATGTTTTTAATTTCTGTTGTTGGAGATTTTACAAAAAAACACAGCGACAATCCATTTATTCCTT 

TTATTGTAGGAGCAGTGGTTTTATCAATAGGGATAAGTTTCGGAGGAATGAACGGTTATGCTATTAATCCTGCAAG 

GGATCTGGGACCAAGAATTTTACTCTTATTTGCTGGATTTAAAAATCACGGATTTAACAATCTAAGTATAGTTATT 

GTACCAATAATTGGCCCAATAATTGGAGCAATTTTGGGAGCTACAATTTACGAATTTACACTAAAAAATAACAAAG 

ACTAA 

t704.nt 

GGAGAAATAATAAAAGGAGGATATACAAATATAGTATTTGGATGGGGATTGGGTGTAACGTTT^ 
CAGCAAGAATGAGCGGAGCACACCTAAACCCAGCTGTTAGCATAGGATTAGCAAGTGTTGGAAAGTTTCCCGTTTC 
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AAAACTTTTACATTACATTGTAGCACAAATATTAGGAGCTTTTACAGGTGCATTAATGACACTTGTCGTATTTTAT 

CCTAAATGGATAGAAATGGATCCTGGCTTAGAAAATACTCAAGGAATAATGGCAACTTTCCCTGCTGTTCCTGGAT 

TTTTGCCTGGATTTATTGATCAAATTTTTGGAACTTTTTTGCTAATGTTTTTAATTTCTGTTGTTGGAGATTT^ 

AAAAAAACACAGCGACAATCCATTTATTCCTTTTATTGTAGGAGCAGTGGTTTTATCAATAGGGATAAGTTTCGGA 

GGAATGAACGGTTATGCTATTAATCCTGCAAGGGATCTGGGACCAAGAATTTTACTCTTATTTGCTGGATTTAAAA 

ATCACGGATTTAACAATCTAAGTATAGTTATTGTACCAATAATTGGCCCAATAATTGGAGCAATTTTGGGAGCTAC 

AATTTACGAATTTACACTAAAAAATAACAAAG 

ACTAA 

f707 .aa 

MRRLFLLYILCSFVFLNLFAQGSSSYIDKQKELAIFYYEVGQRYINVGKIKKGKLFQAKALKIYPDLKKGFDIKLA 
VKELDARIKDDNPKWMLEDIKLEEIPGIVHEKIEINDFTNAPKIEYIAQRERSKNQDKIIKFQFGKFARALISRN 
FDLFDSVIADKVNVMGQFESKNDFISTLSSASSKADADELEYLSVDDYYDLKSLKISKSNDTSFAV^^ 
KNFPFWKERQTLIFTTEDDNNWFLSSINZ 

t707 .aa 

MRRLFLLYILCSFVFIJ^LFAQGSSSYIDKQKELJ^IFYYEVGQRYIIWGKIKKGKLFQAKALKIYPDLKKGFDIKI^ 
VKELDARIKDDNPKWMLEDIKLEEIPGIVHEKIEINDFTNAPKIEYIAQRERSKNQDKIIKFQFGKFARALISRN 
FDLFDSVIADKV1TVMGQFESKNDFISTLSSASSKADADELEYLSVDDYYDLKSLKISKSNDTSFAVN\^ 
KNFPFWKERQTLIFTTEDDNNWFLSSINZ 

f707 .nt 

ATGAGAAGATTATTTCTTCTATATATTTTATGTTCTTTTGTTTTTTTGAATTTATTTGCTCAAGGTAGTTCTTCTT 

ATATTGATAAGCAAAAAGAGCTTGCTATTTTTTATTATGAGGTTGGTCAAAGATATATAAACGTTGGTAAAATTAA 

AAAAGGAAAGCTTTTTCAAGCAAAAGCTTTAAAGATTTATCCAGATTTGAAAAAGGGGTTTGATATCAAGCTTGCA 

GTTAAAGAGCTTGATGCTAGGATTAAAGATGACAATCCCAAGGTTGTTATGCTTGAGGATATTAAGCTTGAGGAGA 

TACCTGGAATAGTGCACGAAAAAATAGAAATCAATGATTTTACAAATGCTCCTAAAATAGAATATATTGCTCAAAG 

AGAGAGAAGCAAAAATCAAGATAAAATTATTAAGTTTCAATTTGGAAAGTTTGCAAGAGCTTTAATTTCTAGGAAC 

TTTGATTTGTTTGATTCAGTTATTGCGGATAAAGTTAACGTTATGGGTCAATTTGAATCAAAAAATGATTTTATAT 

CAACTTTATCAAGTGCTTCATCTAAGGCCGATGCTGATGAGTTAGAGTATTTATCAGTTGATGATTATTACGATTT 

AAAGTCTTTAAAAATTTCAAAATCCAACGATACTTCTTTTV3CTGTTAATGTTAATGCCAAAAAAAATGATGTTAC 

AAAAATTTTCCATTTTGGAAAGAACGTCAAACTTTAATTTTTACTACAGAGGATGATAATAATTGGTTTTTGTCTT 

CCATAAATTGA 

t707 .nt 

CAAGGTAGTTCTTCTTATATTGATAAGCAAAAAGAGCTTGCTATTTTTTATTATGAGGTTGGTCAAAGATATATAA 
ACGTTGGTAAAATTAAAAAAGGAAAGCTTTTTCAAGCAAAAGCTTTAAAGATTTATCCAGATTTGAAAAAGGGGTT 
TGATATCAAGCTTGCAGTTAAAGAGCTTGATGCTAGGATTAAAGATGACAATCCCAAGGTTGTTATGCTTGAGGAT 
ATTAAGCTTGAGGAGATACCTGGAATAGTGCACGAAAAAATAGAAATCAATGATTTTACAAATGCTCCTAAAATAG 
AATATATTGCTCAAAGAGAGAGAAGCAAAAATCAAGATAAAATTATTAAGTTTCAATTTGGAAAGTTTGCAAGAGC 
TTTAATTTCTAGGAACTTTGATTTGTTTGATTCAGTTATTGCGGATAAAGTTAACGTTATGGGTCAATTTGAATCA 
AAAAATGATTTTATATCAACTTTATCAAGTGCTTCATCTAAGGCCGATGCTGATGAGTTAGAGTATTTATCAGTTG 
ATGATTATTACGATTTAAAGTCTTTAAAAATTTCAAAATCCAACGATACTTCTTTTGCTGTTAATGTTAATGCCAA 
AAAAAATGATGTTACTAAAAATTTTCCATTTTGGAAAGAACGTCAAACTTTAATTTTTACTACAGAGGATGATAAT 
AATTGGTTTTTGTCTTCCATAAATTGA 

f709.aa 

MLIFGFIGLFFLNIFSLHAQGIXTTNKDAQEEFKWALNSYNNGIYDDALLSFKKILSFDPNNLOYHFWTGNVYY 
YVEEAIJyiEWRNLKDQGYKVPYLRHLISTIEQRRGIFSNYELNFKKLVKVASLDNSIYKRPHGYQITSLRADKYGGY 
YAANFVGNEILYFDVNNNVNALVKDGFSYLKSPYDVIEANNLLYVTLYSSDEIGVYDKVLGV^ 
LLAPQYMAIDKRNYIYVSEWGNKRVSKFGLEGDFILHFGSRTSGYKGLLGPTGVTYLNENIYVADSLRNTIEV^ 
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SGNHLYSVFTSIEGIEGIiSSDFVGNNVIVSSKrXSVYKYSIAKKTITKILKADKMNSKISSSILDAN^ 

AKVSVYKSDASLYDSLNVDTORIIRLGGPKIYVELNVSSKSGLPWGLKSENFSISNENYYI\^ 

INIAWFDKSSYMKKYDTDQIVGLNALMELSKNKNFSFINATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVS 

LKLAGSGLMSKSSRRAVVYFSGGILNRKAFEKYSLDTIVSYYKNNDIRFYLILFGNDPINSKLQYLVNETGGAVIP 

FSSYEGVSKVYDLILEQKTGTYLLEYYYPGPQEPNKYFNLSVEANINQQTGRGEFAYFIN 

t709 . aa 

qgivtnki^aqeefkwalnsynngiyddallsfkkilsfdpnnldyhfwtgnvyyrlgyvee;^ 
pylrhlistieqrrgifsnyelnfkklvkvasldnsiykrphgyqitslradkyggyyaanfvgneilyfdvnnnv 
nalvkdgfsylkspydvieaiqnllyvtlyssdeigvydkvlgvkrksignkgtkdgellapqymaidkrwyiyvse 
wgnkrvskfglegdfilhfgsrtsgykgllgptgvtylneniyvadslrntievfdtsgnhlysvftsiegiegls 

SDFVGIsTNVIVSSKIXSVYKYS lAKKTITKI LKADKMNSKISSS I LDANNQMI VSDFN^ 

VRRIIRLGGPKIYVELWSSKSGLPWGLKSENFSISNENYYIVNPKVAYNVNASKDINIAW 

QIVGLNALMELSKNKNFSFINATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVSLKLAGSGLMSKSSRRAWY 

FSGGILNRKAFEKYSLDTIVSYYKNNDIRFYLILFGNDPINSKLQYLVNETGGAVIPFSSYEGVSKVYDLILEQKT 

GTYLLEYYYPGPQEPNKYFNLSVEANINQQTGRGEFAYFIN 

f709.nt 

ATGTTAATTTTTGGTTTTATTGGTTTGTTTTTTTTAAATATTTTTAGTTTGCATGCCCAAGGAATAGTTACTAATA 
AAGATGCTCAAGAAGAGTTTAAATGGGCTCTTAATTCTTATAATAATGGAATTTACGATGATGCTCTTTTATCTTT 
TAAAAAAATTTTAAGCTTTGATCCTAATAATCTTGATTATCATTTTTGGACTGGCAATGTTTATTATAGACTGGGT 
TATGTTGAAGAAGCTTTAATGGAATGGAGAAATTTAAAAGATCAAGGCTATAAGGTTCCCTATCTTAGACATTTGA 
TTTCTACTATTGAGCAAAGGAGAGGTATTTTTTCAAATTATGAACTTAATTTTAAAAAACTTGTAAAAGTTGCTTC 
TCTTGATAATTCTATTTATAAAAGGCCACATGGGTACCAGATTACATCTTTAAGGGCTGATAAGTACGGCGGATAT 
TACGCTGCTAACTTTGTAGGCAATGAAATATTGTATTTTGATGTTAATAACAATGTTAATGCTTTAGTTAAAGATG 
GCTTTAGTTATTTAAAATCACCTTATGATGTTATTGAAGCTAATAATCTGCTTTATGTGACTCTTTATTCAAGTGA 
TGAAATTGGTGTTTATGACAAAGTTCTTGGAGTTAAAAGGAAATCTATTGGGAATAAAGGCACAAAAGATGGCGAA 
TTGCTTGCTCCTCAGTATATGGCTATTGATAAGAGAAACTATATTTATGTAAGTGAGTGGGGAAATAAAAGAGTAA 
GTAAATTTGGACTTGAAGGTGATTTTATTTTGCATTTTGGTTCTAGAACTTCAGGCTATAAGGGCCTTTTAGGTCC 
CACAGGCGTTACTTATTTGAATGAAAACATTTATGTTGCAGATTCTCTGAGAAATACCATTGAAGTTTTTGATACT 
AGTGGTAATCATTTATATTCAGTTTTTACTTCTATTGAGGGAATAGAGGGGCTTAGCAGTGATTTTGTAGGTAATA 
ATGTTATAGTATCCTCAAAAGATGGTGTTTATAAATATAGCATTGCTAAAAAGACAATTACAAAAATTTTAAAAGC 
AGATAAAATGAATTCTAAAATTTCTTCATCTATTTTGGATGCCAATAATCAGATGATTGTCTCAGATTTTAATAAT 
GCCAAGGTTTCAGTTTACAAGAGTGATGCAAGCCTTTATGATAGTTTAAATGTTGATGTTAGAAGAATAATTAGGC 
TTGGAGGGCCTAAAATTTACGTTGAGCTTAATGTTAGCAGTAAAAGCGGATTACCAGTTGTTGGGCTTAAAAGTGA 

aaatttttcaatttcaaatgaaaattattacattgtcaatcccaaggtggcatataatgtaaatgcttcaai^ 

attaatatagcagttgtttttgataaatcttcttatatgaaaaaatatgatacagatcaaattgtagggttaaatg 

ccctaatggagttgtcaaaaaataaaaactttagttttataaatgcaacaagtgtgcccattatagataatattga 

aagcttaacaaatagcattagaaatacaagttctcttggtccttatagtacagatgctgtaaaaacagacgttagt 

ttgaagttggcaggttctgggcttatgtcaaaaagctcaagaagagcagtagtttattttagtggtggtattttaa 

atcgtaaagcttttgaaaagtactctttagatacaatagtaagctattataaaaataatgatataaggttttactt 

aatactatttggtaatgatcctattaatagtaagcttcagtatttagttaatgaaacaggcggtgctgtaattcct 

ttttcatcttatgaaggtgtatctaaagtttatgatttaattttagaacaaaaaacgggcacttatttgttggaat 

attattatccaggccctcaagaacctaataaatattttaatttatctgttgaagcaaatataaatcaacagacagg 

AAGAGGGGAGTTTGCATATTTTATTAATTAG 
t709 .nt 

CAAGGAATAGTTACTAATAAAGATGCTCAAGAAGAGTTTAAATGGGCTCTTAATTCTTATAATAATGGAATTO 

ATGATGCTCTTTTATCTTTTAAAAAAATTTTAAGCTTTGATCCTAATAATCTTGATTATCATTTTTC 

TGTTTATTATAGACTGGGTTATGTTGAAGAAGCTTTAATGGAATGGAGAAATTTAAAAGATCAA^ 

CCCTATCTTAGACATTTGATTTCTACTATTGAGCAAAGGAGAGGTATTTTTTCAAATTATGAACTTAATTTTAAA^ 

AACTTGTAAAAGTTGCTTCTCTTGATAATTCTATTTATAAAAGGCCACATGGGTACCAGATTACATCTO 

TGATAAGTACGGCGGATATTACGCTGCTAACTTTGTAGGCAATGAAATATTGTATTTTGATGTTAATAACAATC 
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AATGCTTTAGTTAAAGATGGCTTTAGTTATTTAAAATCACCTTATGATGTTATTGAAGCTAATAATCTGCTTTATG 
TGACTCTTTATTCAAGTGATGAAATTGGTGTTTATGACAAAGTTCTTGGAGTTAAAAGGAAATCTATTGGGAATAA 
AGGCACAAAAGATGGCGAATTGCTTGCTCCTCAGTATATGGCTATTGATAAGAGAAACTATATTTATGTAAGTGAG 
TGGGGAAATAAAAGAGTAAGTAAATTTGGACTTGAAGGTGATTTTATTTTGCATTTTGGTTCTAGAACTTCAGGCT 
ATAAGGGCCTTTTAGGTCCCACAGGCGTTACTTATTTGAATGAAAACATTTATGTTGCAGATTCTCTGAGAAATAC 
CATTGAAGTTTTTGATACTAGTGGTAATCATTTATATTCAGTTTTTACTTCTATTGAGGGAATAGAGGGGCTTAGC 
AGTGATTTTGTAGGTAATAATGTTATAGTATCCTCAAAAGATGGTGTTTATAAATATAGCATTGCTAAAAAGACAA 
TTACAAAAATTTTAAAAGCAGATAAAATGAATTCTAAAATTTCTTCATCTATTTTGGATGCCAATAATCAGATGAT 
TGTCTCAGATTTTAATAATGCCAAGGTTTCAGTTTACAAGAGTGATGCAAGCCTTTATGATAGTTTAAATGTTGAT 
GTTAGAAGAATAATTAGGCTTGGAGGGCCTAAAATTTACGTTGAGCTTAATGTTAGCAGTAAAAGCGGATTACCAG 
TTGTTGGGCTTAAAAGTGAAAATTTTTCAATTTCAAATGAAAATTATTACATTGTCAATCCCAAGGTGGCATATAA 
TGTAAATGCTTCAAAAGACATTAATATAGCAGTTGTTTTTGATAAATCTTCTTATATGAAAAAATATGATACAGAT 
CAAATTGTAGGGTTAAATGCCCTAATGGAGTTGTC/^AAAAATAAAAACTTTAGTTTTATAAATGCAACAAGTGTGC 
CCATTATAGATAATATTGAAAGCTTAACAAATAGCATTAGAAATACAAGTTCTCTTGGTCCTTATAGTACAGATGC 
TGTAAAAACAGACGTTAGTTTGAAGTTGGCAGGTTCTGGGCTTATGTCAAAAAGCTCAAGAAGAGCAGTAGTTTAT 
TTTAGTGGTGGTATTTTAAATCGTAAAGCTTTTGAAAAGTACTCTTTAGATACAATAGTAAGCTATTATAAAAATA 
ATGATATAAGGTTTTACTTAATACTATTTGGTAATGATCCTATTAATAGTAAGCTTCAGTATTTAGTTAATGAAAC 
AGGCGGTGCTGTAATTCCTTTTTCATCTTATGAAGGTGTATCTAAAGTTTATGATTTAATTTTAGAACAAAAAACG 
GGCACTTATTTGTTGGAATATTATTATCCAGGCCCTCAAGAACCTAATAAATATTTTAATTTATCTGTTGAAGCAA 
ATATAAATCAACAGACAGGAAGAGGGGAGTTTGCATATTTTATTAATTAG 

f730.aa 

MIKSILDYLLTLHPVLLGLLGSTFTWFTTAFGAAAVFFFRKVDNKIMDAMLGFSAGIMIAASFFSLIQPAIERAEE 
LGYITWVPAVFGFLVGAFFIYIVDVFVPDLDKLTFIDEDLTKHGKKDFLLFTAVTLHNFPEGLAVGVAFGALASNP 
DIQTLVGAMLLTLGIGIQNIPEGAAISLPLRRGNVALAKCFNYGQMSGLVEIVGGUyiGAYAWSFT^ 
AGAMIYVSIEQLIPEAKRKDIDNKVPSIFGVIGFTLMMFLDVSLGZ 

t730 .aa 

AVFFFRKVDNKIMDAMLGFSAGIMIAASFFSLIQPAIERAEELGYITWVPAVFGFLVGAFFIYIVDVFVPDLDKLT 
FIDEDLTKHGKKDFLLFTAVTLHNFPEGLAVGVAFGALASNPDIQTLVGAMLLTLGIGIQNIPEGAAISLPLRRGN 
VALAKCFNYGQMSGLVEIVGGLMGAYAVYSFTRILPFALAFSAGAMIYVSIEQLIPEAKRKDIDNKVPSIFGVIGF 
TLMMFLDVSLGZ 

f730.nt 

ATGATAAAATCAATTTTAGATTATTTATTAACTTTGCATCCTGTATTATTGGGACTTTTAGGTTCTACTTTCACTT 

GGTTTACTACAGCTTTTGGAGCAGCAGCAGTTTTTTTCTTTAGAAAGGTAGATAATAAAATAATGGACGCTATGCT 

TGGTTTTTCAGCTGGCATTATGATAGCGGCCAGTTTTTTTTCGCTTATTCAGCCTGCTATAGAAAGAGCTGAAGAG 

CTTGGATACATTACTTGGGTGCCGGCTGTTTTTGGATTTCTTGTTGGGGCATTTTTTATATATATTG 

TTGTTCCAGATCTGGATAAACTTACTTTTATTGATGAAGACTTAACTAAACATGGTAAAAAAGATTTTTTACTCTT 

TACTGCTGTTACTTTACATAATTTTCCAGAAGGATTGGCTGTTGGAGTTGCTTTTGGAGCCTTGGCGTCTAATCCA 

GATATTCAAACTTTAGTTGGGGCTATGCTTCTTACGCTTGGTATTGGTATTCAAAATATTCCCGAAGGAGCAGCTA 

TTTCTCTGCCTTTAAGAAGAGGTAATGTTGCTTTGGCAAAATGCTTTAACTATGGCCAAATGTCAGGATTGGTAGA 

AATTGTGGGGGGGCTTATGGGTGCTTATGCGGTTTATTCTTTTACTCGAATTTTACCTTTTGCTTTGGCTTTTTCT 

GCAGGAGCTATGATTTATGTGTCAATTGAACAATTAATACCTGAAGCTAAGAGAAAAGACATTGACAATAAAGTGC 

CAAGTATATTTGGTGTTATTGGTTTTACATTAATGATGTTTCTCGATGTTTCACTAGGTTAA 

t730.nt 

GCAGTTTTTTTCTTTAGAAAGGTAGATAATAAAATAATGGACGCTATGCTTGGTTTTTCAGCTGGCATT^ 
CGGCCAGTTTTTTTTCGCTTATTCAGCCTGCTATAGAAAGAGCTGAAGAGCTTGGATACATTACTTC 
TGTTTTTGGATTTCTTGTTGGGGCATTTTTTATATATATTGTAGATGTATTTGTTCCAGATCTC 
TTTATTGATGAAGACTTAACTAAACATGGTAAAAAAGATTTTTTACTCTTTACTGCTC 

CAGAAGGATTGGCTGTTGGAGTTGCTTTTGGAGCCTTGGCGTCTAATCCAGATATTCAAACTTTAGTT<^^ 
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GCTTCTTACGCTTGGTATTGGTATTCAAAATATTCCCGAAGGAGCAGCTATTTCTCTGCCTTTAAGAAGAGGTAAT 

GTTGCTTTGGCAAAATGCTTTAACTATGGCCAAATGTCAGGATTGGTAGAAATTGTGGGGGGGCTTATG^ 

ATGCGGTTTATTCTTTTACTCGAATTTTACCTTTTGCTTTGGCTTTTTCTGCAGGAGCTATGATTTATGTGTCAAT 

TGAACAATTAATACCTGAAGCTAAGAGAAAAGACATTGACAATAAAGTGCCAAGTATATTTGGTGTTATTGGTTTT 

ACATTAATGATGTTTCTCGATGTTTCACTAGGTTAA 



fl97 .aa 

MLLKLKYRFVGFLLLFLIFILLLFSTIFNFVIiCGYLEDYYKQLTRAQVRRAAFSLQSFLDTLHVIINGAASNLALE 
TISEFAMSENRGKDFSESELIDLRKNPKFVIDSVKVSKKYRQYLYNFMANLKNDTLFEEFAFFDFEGRVIVSTRHE 
NNMDFGHSEANTNYFKKAVEDYRQNQLKFIGWYSNLSEGISAEVAIRSKQSEKKAFAIIVPVYSPEDKLVCGYLAG 
YLLNDIVADSFDRFRFGFYKRGNFIYVDPNNIAVNPFEEYNETSRVSSKFLNVLKDVFSKPPFPSNIASEVSVYTI 
DRILLSEMGEDCYYAMIiPISSKLGEKSGVLIARLPYKDIYGVISSLRFQYILYSVLGIIALSIVLSIRIDRIISFR 
I^AIRVLVQDMVKGNLDKDYALDDDENTLDELGMLSLQWKMKKAISVAISSVLRNISYVNKASLEVASSSQNLSS 
SALQQASALEEMSAI^JVEQIASGVNMSANNSYETEQIALKTNENSQIGGRAVEESVIAMQDIVEKVSVIEEIARKTN 
LLALNAAIEAARAGDEGKGFAWASEIRKLADLSKISALEIGELVEDNSKVATEAGVIFKEMLPEIEETANLVKKI 
SEGSSKQSDQIAQFKMALDQVGEWQSSASSSEQLSSMSDKMLEKSKELRKSVLFFKIKDSKIENPENDDYDFRLT 
DCPENSFKDENQNLKSNGISTSNASGHNNYSLDIESESSVRTINKRVDPKKAIDIADKDLNFDDDFSEF 

tl97.aa 

VLCGYLEDYYKQLTRAQWRAAFSLQSFLDTLHVIINGAASNLALETISEFAMSENRGKDFSESELIDLRKNPKFV 

IDSVKVSKKYRQYLYNFMANLKNDTLFEEFAFFDFEGRVIVSTRHENNMDFGHSEANTNYFKKAVEDYRQNQLKFI 

GWYSNLSEGISAEVAIRSKQSEKKAFAIIVPVYSPEDKLVCGYLAGYLLNDIVADSFDRFRFGFYKRGNFIYVDPN 

NIAVNPFEEYNETSRVSSKFIJWLKDVFSKPPFPSNIASEVSVYTIDRILLSEMGEIXrYYAMLPISSKI^EKSGVL 

lARLPYKDIYGVISSLRFQYILYSVLGIIALSIVLSIRIDRIISFRLNAIRVLVQDMVKGNLDKDYALDDDENTLD 

ELGMLSLQVVKMKKAISVAISSVLRNISYWKASLEVASSSQNLSSSALQQASALEEMSANVEQIASGV]S^ 

YETEQIALKTNENSQIGGRAVEESVIAMQDIVEKVSVIEEIARKTNLLALNAAIEAARAGDEGKGFAWASEIRKL 

ADLSKISALEIGELVEDNSKVATEAGVIFKEMLPEIEETANLVKKISEGSSKQSDQIAQFKMALDQVGEWQSSAS 

SSEQLSSMSDKMLEKSKELRKSVLFFKIKDSKIENPENDDYDFRLIDCPENSFKDENQNLKSNGISTSNASGHNNY 

SLDIESESSVRTINKRVDPKKAIDIADKDLNFDDDFSEF 

fl97 .nt 

ATGTTATTGAAGCTTAAATACAGGTTTGTTGGATTTTTATTATTGTTTTTAATTTTTATACTGCTACTTTTTTCCA 

CGATTTTTAATTTTGTTTTATGCGGTTATTTAGAAGATTATTATAAGCAGCTTACAAGGGCGCAAGTAAGAAGAGC 

AGCTTTTTCTTTGCAATCTTTTTTAGACACCCTGCATGTCATAATCAATGGTGCAGCTTCTAATTTGGCACTTGAA 

ACCATATCAGAATTTGCAATGTCTGAGAATAGAGGAAAAGATTTCTCTGAGTCGGAATTGATAGATTTAAGAAAAA 

ATCCAAAATTTGTTATTGACTCTGTAAAGGTGAGCAAAAAATATCGACAATACTTATACAATTTTATGGCCAATCT 

TAAAAATGATACCCTTTTTGAAGAATTCGCTTTTTTTGATTTTGAAGGGAGAGTAATTGTTAGCACAAGACATG 

AATAATATGGATTTTGGTCATTCTGAGGCTAATACCAATTATTTTAAAAAAGCTGTTGAGGATTATAGGCAAAACC 

AATTAAAATTTATAGGTTGGTATTCAAATCTTTCTGAAGGAATATCCGCAGAAGTTGCTATTAGGTCTAAACAAAG 

CGAAAAAAAGGCTTTTGCAATAATTGTACCTGTATATTCCCCAGAAGATAAACTTGTTTGTGGGTATTTGGCCGGA 

TATTTGCTTAATGATATTGTGGCAGATAGTTTTGATAGATTTAGATTCGGTTTTTATAAAAGAGGCAATTTTATTT 

ATGTGGATCCCAACAATATAGCAGTTAATCCTTTTGAAGAATATAATGAAACCAGCAGGGTTAGTTCTAAATTTTT 

GAATGTTCTTAAAGATGTTTTCTCTAAGCCCCCTTTTCCATCAAACATTGCCAGTGAAGTGTCGGTTTACACTATT 

GATAGAATACTTTTGTCCGAAATGGGAGAAGATTGTTATTATGCAATGTTGCCCATAAGTAGTAAATTGGGAGAA^ 

AGAGTGGAGTACTTATTGCTAGGCTTCCTTATAAGGATATTTACGGAGTAATATCTAGTCTAAGATTTCAGTATAT 

TTTATATTCAGTCTTAGGCATTATAGCATTAAGTATTGTTCTTTCAATTAGAATAGACAGGATTATTAGTTTTCGT 

TTAAACGCAATTAGAGTTCTAGTTCAAGATATGGTTAAGGGCAATTTAGATAAAGATTATGCTCTTGATGATGATC 

AAAATACTCTTGATGAACTTGGCATGTTAAGTCTTCAGGTTGTTAAAATGAAAAAAGCTATTTCTG 

TAGTGTTTTGAGAAATATTAGCTATGTAAATAAGGCAAGTTTAGAAGTTGCCAGTTCAAGTCAAAATTTAAGCTCT 

AGTGCATTGCAACAGGCATCTGCTCTTGAAGAAATGTCAGCTAATGTTGAGCAAATAGCCTCAGGTGTCAACATC^ 

GCGCCAATAATTCTTATGAAACAGAACAAATAGCTTTAAAGACGAATGAAAATTCTCAGATAGGTGGTAGGGCCGT 

TGAAGAATCTGTTATTGCTATGCAAGACATTGTGGAGAAAGTTAGTGTTATTGAAGAGATAGCT^ 

TTACTTGCTTTGAATGCGGCTATTGAAGCTGCAAGAGCAGGAGATGAGGGAAAGGGATTTGCTGTTGTGGCCAGTC 
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AGATTAGAAAGTTGGCTGATTTGAGTAAAATTTCTGCTCTTGAGATTGGAGAGTTAGTTGAAGATAACTCTAAGGT 
AGCAACTGAAGCGGGAGTGATCTTTAAAGAAATGCTACCCGAAATTGAAGAAACGGCTAATCTTGTTAAGAAGAT^ 
TCAGAAGGTAGCTCTAAGCAAAGCGATCAGATTGCTCAATTTAAAATGGCTTTAGATCAGGTTGGAGAAGTTGTTC 
AATCTTCAGCTTCAAGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGTTAGAAAAGTCTAAGGAACTTAGAAA 
ATCTGTATTATTTTTCAAAATTAAAGATTCTAAAATTGAAAATCCAGAAAATGATGATTATGATTTCAGGTTAATA 
GATTGTCCTGAAAATTCTTTTAAAGATGAAAATCAAAATTTGAAAAGCAATGGAATTTCTACTTCAAATGCCAGTG 
GGCATAATAATTATTCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTATTAATAAGCGAGTTGATCCTAAAAA 
AGCTATCGATATTGCTGATAAGGATTTAAATTTTGATGATGATTTTTCAGAGTTTTAG 

tl97 .nt 

GTTTTATGCGGTTATTTAGAAGATTATTATAAGCAGCTTACAAGGGCGCAAGTAAGAAGAGCAGCTTTTTCTTTGC 
AATCTTTTTTAGACACCCTGCATGTCATAATCAATGGTGCAGCTTCTAATTTGGCACTTGAAACCATATCAGAATT 
TGCAATGTCTGAGAATAGAGGAAAAGATTTCTCTGAGTCGGAATTGATAGATTTAAGAAAAAATCCAAAATTTGTT 
ATTGACTCTGTAAAGGTGAGCAAAAAATATCGACAATACTTATACAATTTTATGGCCAATCTTAAAAATGATACCC 
TTTTTGAAGAATTCGCTTTTTTTGATTTTGAAGGGAGAGTAATTGTTAGCACAAGACATGAGAATAATATGGATTT 
TGGTCATTCTGAGGCTAATACCAATTATTTTAAAAAAGCTGTTGAGGATTATAGGCAAAACCAATTAAAATTTATA 
GGTTGGTATTCAAATCTTTCTGAAGGAATATCCGCAGAAGTTGCTATTAGGTCTAAACAAAGCGAAAAAAAGGCTT 
TTGCAATAATTGTACCTGTATATTCCCCAGAAGATAAACTTGTTTGTGGGTATTTGGCCGGATATTTGCTTAATGA 
TATTGTGGCAGATAGTTTTGATAGATTTAGATTCGGTTTTTATAAAAGAGGCAATTTTATTTATGTGGATCCCAAC 
AATATAGCAGTTAATCCTTTTGAAGAATATAATGAAACCAGCAGGGTTAGTTCTAAATTTTTGAATGTTCTTAAAG 
ATGTTTTCTCTAAGCCCCCTTTTCCATCAAACATTGCCAGTGAAGTGTCGGTTTACACTATTGATAGAATACTTTT 
GTCCGAAATGGGAGAAGATTGTTATTATGCAATGTTGCCCATAAGTAGTAAATTGGGAGAAAAGAGTGGAGTACTT 
ATTGCTAGGCTTCCTTATAAGGATATTTACGGAGTAATATCTAGTCTAAGATTTCAGTATATTTTATATTCAGTCT 
TAGGCATTATAGCATTAAGTATTGTTCTTTCAATTAGAATAGACAGGATTATTAGTTTTCGTTTAAACGCAATTAG 
AGTTCTAGTTCAAGATATGGTTAAGGGCAATTTAGATAAAGATTATGCTCTTGATGATGATGAAAATACTCTTGAT 
GAACTTGGCATGTTAAGTCTTCAGGTTGTTAAAATGAAAAAAGCTATTTCTGTAGCAATTTCTAGTGTTTTGAGAA 
ATATTAGCTATGTAAATAAGGCAAGTTTAGAAGTTGCCAGTTCAAGTCAAAATTTAAGCTCTAGTGCATTGCAACA 
GGCATCTGCTCTTGAAGAAATGTCAGCTAATGTTGAGCAAATAGCCTCAGGTGTCAACATGAGCGCCAATAATTCT 
TATGAAACAGAACAAATAGCTTTAAAGACGAATGAAAATTCTCAGATAGGTGGTAGGGCCGTTGAAGAATCTGTTA 
TTGCTATGCAAGACATTGTGGAGAAAGTTAGTGTTATTGAAGAGATAGCTAGAAAAACCAATTTACTTGCTTTGAA 
TGCGGCTATTGAAGCTGCAAGAGCAGGAGATGAGGGAAAGGGATTTGCTGTTGTGGCCAGTGAGATTAGAAAGTTG 
GCTGATTTGAGTAAAATTTCTGCTCTTGAGATTGGAGAGTTAGTTGAAGATAACTCTAAGGTAGCAACTGAAGCGG 
GAGTGATCTTTAAAGAAATGCTACCCGAAATTGAAGAAACGGCTAATCTTGTTAAGAAGATTTCAGAAGGTAGCTC 
TAAGCAAAGCGATCAGATTGCTCAATTTAAAATGGCTTTAGATCAGGTTGGAGAAGTTGTTCAATCTTCAGCTTCA 
AGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGTTAGAAAAGTCTAAGGAACTTAGAAAATCTGTATTATTTT 
TCAAAATTAAAGATTCTAAAATTGAAAATCCAGAAAATGATGATTATGATTTCAGGTTAATAGATTGTCCTGAAAA 
TTCTTTTAAAGATGAAAATCAAAATTTGAAAAGCAATGGAATTTCTACTTCAAATGCCAGTGGGCATAATAATTAT 
TCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTATTAATAAGCGAGTTGATCCTAAAAAAGCTATCGATATTG 
CTGATAAGGATTTAAATTTTGATGATGATTTTTCAGAGTTTTAG 

£200. aa 

MTISKWFSKFILKFLNSSAFVSVFALFVGFLIVGLVVMGLGHSPFRMYFIILEIIFSSPKHLGYVLSYSAPL^ 

GLSIGISLKAGLFNIGVEGQFIIiGSIVALIASVLLDLPPILHVITIFIITFLASGSLGILIGYLKAKFNISEVISG 

IMFNWILFHLNNIILDFSFIKRDNSDFSKPIKESAYIDFIASWKLSPEGLAYRSSHPFVNELLK^ 

FAILIWFLLNKTIIGFKINATGSNIEASRCMGIlSr^nCAVLIFSMFLSAAVAGIJlGAIQIJyiGWKAIFKL 

NGIAASLMG]^SPIGIIFSSILFSILLYGSSRVQSIJ4GLPSSIVSLMMGIIVLVISASYFLNKIVLKGVKRVKYNN 

ILD 



t200 .aa 

lvvmgix3hspfrmyfiileiifsspkhlgyvlsysapliftglsigislkaglfnigvt;gqfilgsivalias^ 

dlppilhvitifiitflasgslgiligylkakfnisevisgimf^jwilfhlnniildfsfikrdnsdfskpikesa 

yidfi^swklspegi^yrsshpfvnellkaplhfgiilgiifailiwfli^tiigfkinatgsnieasrcmgin^ 
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KAVLIFSMFLSAAVAGLAGAIQmGVNKAIFKLSYMQGIGFNGIAASLMGNNSPIGIIFSSILFSILLYGSSRVQS 
LMGLPSSIVSLMMGIIVLVISASYFLNKIVLKGVKRVKYNNILD 

f200.nt 

ATGACAATTAGTAAAAACGTATTTAGTAAATTTATTTTGAAATTTTTAAATTCTTCAGCATTTGTTAGTGTATTTG 

CTCTATTTGTTGGATTTTTAATTGTTGGGCTAGTGGTGATGGGGCTTGGTCATTCTCCTTTTAGAATGTATTTTA 

AATATTAGAAATTATTTTTTCTTCTCCCAAACATTTAGGTTATGTTTTAAGTTATTCAGCTCCTTTGATTTTTACA 

GGTCTTTCTATTGGTATTTCTTTAAAAGCGGGTCTTTTTAATATTGGGGTTGAAGGCCAGTTTATACTAGGATCTA 

TTGTTGCTTTAATAGCATCAGTTTTACTTGATTTGCCTCCAATTTTACATGTAATTACTATTTTTATTATTACTTT 

TTTAGCTTCAGGCAGTTTAGGAATTTTAATCGGATATTTAAAAGCCAAATTCAATATTAGCGAAGTGATTTCAGGA 

ATAATGTTTAATTGGATATTATTTCATTTAAATAATATAATTTTAGATTTTAGTTTTATTAA^ 

ATTTTTCAAAACCCATTAAAGAAAGCGCATATATTGATTTTTTAGCTTCTTGGAAGCTCTCACCAGAAGGTCTTGC 

TTATAGATCTTCTCATCCTTTTGTTAATGAGCTTTTAAAAGCACCTCTTCATTTTGGAATAATTTTAGGTATAATT 

TTTGCTATTTTAATATGGTTTTTACTTAATAAAACTATTATTGGATTTAAAATAAATGCCACAGGAAGTAATATTG 

AAGCTTCAAGATGTATGGGTATTAATGTAAAAGCTGTGCTAATTTTTTCAATGTTTCTCTCAGCAGCTGTTGCAGG 

TCTTGCTGGTGCTATTCAACTTATGGGTGTTAATAAAGCTATATTTAAGCTTTCTTATATGCAAGGAATTGGTTTT 

AATGGGATAGCTGCTTCTCTTATGGGAAACAATTCGCCAATTGGCATAATATTTTCTAGCATTCTTTTTTCTATAT 

TGCTTTATGGAAGCAGTAGAGTTCAAAGTTTAATGGGCCTTCCATCTTCAATTGTATCTTTGATGATGGGAATAAT 

TGTTCTTGTAATTTCTGCTAGCTATTTTTTAAATAAAATTGTTTTAAAAGGTGTTAAGCGTGTCAAATATAATAAT 

ATTCTTGATTAG 

t200 .nt 

GGGCTAGTGGTGATGGGGCTTGGTCATTCTCCTTTTAGAATGTATTTTATAATATTAGAAATTATTTTTTCTTCTC 
CCAAACATTTAGGTTATGTTTTAAGTTATTCAGCTCCTTTGATTTTTACAGGTCTTTCTATTGGTATTTCTTTAAA 
AGCGGGTCTTTTTAATATTGGGGTTGAAGGCCAGTTTATACTAGGATCTATTGTTGCTTTAATAGCATCAGTTTTA 
CTTGATTTGCCTCCAATTTTACATGTAATTACTATTTTTATTATTACTTTTTTAGCTTCAGGCAGTTTAGGAATTT 
TAATCGGATATTTAAAAGCCAAATTCAATATTAGCGAAGTGATTTCAGGAATAATGTTTAATTGGATATTATTTCA 
TTTAAATAATATAATTTTAGATTTTAGTTTTATTAAAAGAGATAATAGTGATTTTTCAAAACCCATTAAAGAAAGC 
GCATATATTGATTTTTTAGCTTCTTGGAAGCTCTCACCAGAAGGTCTTGCTTATAGATCTTCTCATCCTTTTGTTA 
ATGAGCTTTTAAAAGCACCTCTTCATTTTGGAATAATTTTAGGTATAATTTTTGCTATTTTAATATGGTTTTTACT 
TAATAAAACTATTATTGGATTTAAAATAAATGCCACAGGAAGTAATATTGAAGCTTCAAGATGTATGGGTATTAAT 
GTAAAAGCTGTGCTAATTTTTTCAATGTTTCTCTCAGCAGCTGTTGCAGGTCTTGCTGGTGCTATTCAACTTATGG 
GTGTTAATAAAGCTATATTTAAGCTTTCTTATATGCAAGGAATTGGTTTTAATGGGATAGCTGCTTCTCTTATGGG 
AAACAATTCGCCAATTGGCATAATATTTTCTAGCATTCTTTTTTCTATATTGCTTTATGGAAGCAGTAGAGTTCAA 
AGTTTAATGGGCCTTCCATCTTCAATTGTATCTTTGATGATGGGAATAATTGTTCTTGTAATTTCTGCTAGCTATT 
TTTTAAATAAAATTGTTTTAAAAGGTGTTAAGCGTGTCAAATATAATAATATTCTTGATTAG 

f208 .aa 

MVKKFSIFLKAIIIFSIFELLIEELSIILFLPYKIRFALIFLGFLFDTIFIFIFLYKITKAYLSQRLEIYVRNNLF 
FDIIHCLIPLAFYSSYQLKNIIVAHETILNPIMLSLFKLRFLRLLRFNDLIIEIYYNSKEKNLILIAFARTFSMSL 
LIPFTFFIIISSSKIVNSIPEKQEFNIIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYRVI 
EMEKTKFYIDKYLQRKSDSILGIFLFTLFASFTIFLl^FYKFFKASFLNPIILMTKILQDPLEYRKIQIPFTLSEE 
KVYELAKSFNNLLLKEKLNSKRKSKIPLEIEKVKKIINKNQEIK 

t208 .aa 

IIIFSIFELLIEELSIILFLPYKIRFALIFLGFLFDTIFIFIFLYKITKAYLSQRLEIYVRNNLFFDIIHCLIPLA 
FYSSYQLKNIIVAHETILNPIMLSLFKLRFLRLLRFNDLIIEIYYNSKEKNLILIAFARTFSMSLLIPFTFFIIIS 
SSKIVNSIPEKQEFNIIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYRVIEMEKTKFYIDK 
YLQRKSDSIIiGIFLFTLFASFTIFLMNFYKFFKASFLNPIII^TKILQDPLEYRKIQIPFTLSEEKVYEIJVKSFK^ 
LLLKEKLNSKRKSKIPLEIEKVKKIINKNQEIK 



f208.nt 
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ATGGTAAAAAAATTTTCAATTTTCTTAAAAGCAATAATAATTTTTTCAATATTTGAACTTTTAATCGAAGAACTCT 

CAATAATTCTTTTTTTACCATACAAAATACGATTTGCACTAATATTTCTTGGGTTTCTATTTGACACAATTTTTAT 

TTTCATTTTTTTATACAAAATAACCAAGGCCTACCTTTCCCAAAGATTAGAAATCTACGTCAGAAACAATCTATTC 

TTCGATATAATCCACTGCCTTATTCCTTTAGCGTTTTATAGCTCATATCAGCTTAAAAACATAATTGTCGCCCATG 

AAACAATATTAAATCCAATAATGCTATCACTTTTCAAGTTAAGATTTTTAAGACTTCTTAGGTTTAATGACCTAAT 

AATAGAAATATATTACAATTCAAAAGAAAAGAACCTAATACTAATAGCATTTGCTAGGACATTTTCAATGAGCTTA 

TTAATACCATTTACATTTTTTATAATAATATCAAGCTCAAAAATTGTAAATTCAATACCAGAAAAACAAGAATTTA 

ATATCATTAAAAATATATCAATAATAAATGAAAAAGCTTACATTAAAGAAAAATATCCCTTCATCTTAATAATCAA 

GGAAAAAGATGACATAATATACTCAAAATCAGACGAAATATTTGTTTACTACAGTCCCAGTGAATATAGAGTAATA 

GAAATGGAGAAAACAAAATTTTATATAGATAAATATTTGCAAAGAAAAAGCGATTCTATTCTTGGAATTTTTCTAT 

TTACATTGTTTGCATCATTTACTATTTTTTTAATGAATTTTTATAAATTTTTTAAAGCAAGCTTTTTAAATCCTAT 

TATTTTAATGACAAAAATTTTACAAGACCCATTAGAATATCGAAAAATTCAAATTCCTTTTACTTTAAGCGAAGAA 

AAAGTATATGAACTTGCAAAATCATTTAACAATCTCTTGCTAAAAGAAAAACTAAACTCAAAGCGAAAAA^ 

TACCTTTAGAAATTGAAAAAGTAAAAAAAATAATT/^TAAAAACCAGGAAATAAAATGA 

t208 .nt 

ATAATAATTTTTTCAATATTTGAACTTTTAATCGAAGAACTCTCAATAATTCTTTTTTTACCATACAAAATACGAT 

TTGCACTAATATTTCTTGGGTTTCTATTTGACACAATTTTTATTTTCATTTTTTTATACAAAATAACCAAGGCCTA 

CCTTTCCCAAAGATTAGAAATCTACGTCAGAAACAATCTATTCTTCGATATAATCCACTGCCTTATTCCTTTAGCG 

TTTTATAGCTCATATCAGCTTAAAAACATAATTGTCGCCCATGAAACAATATTAAATCCAATAATGCTATCACTTT 

TCAAGTTAAGATTTTTAAGACTTCTTAGGTTTAATGACCTAATAATAGAAATATATTACAATTCAAAAGAAAAGAA 

CCTAATACTAATAGCATTTGCTAGGACATTTTCAATGAGCTTATTAATACCATTTACATTTTTTATAATAATATCA 

AGCTCAAAAATTGTAAATTCAATACCAGAAAAACAAGAATTTAATATCATTAAAAATATATCAATAATAAATGAAA 

AAGCTTACATTAAAGAAAAATATCCCTTCATCTTAATAATCAAGGAAAAAGATGACATAATATACTCAAAATCAGA 

CGAAATATTTGTTTACTACAGTCCCAGTGAATATAGAGTAATAGAAATGGAGAAAACAAAATTTTATATAGATAAA 

TATTTGCAAAGAAAAAGCGATTCTATTCTTGGAATTTTTCTATTTACATTGTTTGCATCATTTACTATTTTTTTAA 

TGAATTTTTATAAATTTTTTAAAGCAAGCTTTTTAAATCCTATTATTTTAATGACAAAAATTTTACAAGACCCATT 

AGAATATCGAAAAATTCAAATTCCTTTTACTTTAAGCGAAGAAAAAGTATATGAACTTGCAAAATCATTTAACAAT 

CTCTTGCTAAAAGAAAAACTAAACTCAAAGCGAAAAAGCAAAATACCTTTAGAAATTGAAAAAGTAAAAAAAA 

TTAATAAAAACCAGGAAATAAAATGA 



f210.aa 

I^IQIIIMLLJ^LLDFPIJSrARLLDISIEKRADEEIKKYSSYNLILEKEyYTNFPTSEIEKNIYKLTEHFVKSIMLNK 

TNYSLLNSNYKEANKYLIQSELIDKKFLKYKIFKIKJSriNGIFKSHSLIYTKKGFYKLELYIENNAEPLKIFNLNIT 

YFLKNLDKISNEMIFFPREKREVlS^y[IQKTTIAADSSSKPRGI^n"DTGIPFNVLIVDDSVFTVKQLTQIFTSE^ 

IiyrAADGEEAVIKYKNHYPNIDIVTLDITMPKMDGITCLSNIMEFDKNARVIMISALGKEQLV^ 

KPLDRAKVLQRVMSVFVK 



t210 .aa 

RLLDISIEKRADEEIKKYSSYNLILEKEYYTNFPTSEIEKNIYKLTEHFVTCSIMLNKTNYSLIiNSKPi^K^ 
SELIDKKFLKYKIFKIKNINGIFKSHSLIYTKKGFYKLELYIENNAEPLKIFNLNITYFLKNLDKISNEMIFFPRE 
KREVNMIQKTTIAADSSSKPRGINYOTGI PFNVLIVDDSVFTVKQLTQIFTSEGFNI IDTAAIXSEEA^^ 
NIDIVTLDITMPKMDGITCLSNIMEFDKNARVIMISALGKEQLVKDCLIKGAKTFIVKPLDRAKVLQRVMSV 

f210.nt 

ATGAAAATTCAAATAATTATAATGCTGCTTGCATTGTTAGATTTTCCACTTAATGCCAGACTTTTGGACATTTCAA 

TTGAAAAAAGAGCAGATGAAGAAATAAAAAAATATTCGTCTTATAATTTAATTTTAGAAAAAGAATACTATACCAA 

TTTTCCAACAAGCGAAATAGAAAAAAATATTTATAAACTAACAGAACATTTTGTAAAAAGCATAATGCTC;^ 

ACTAACTACAGCTTATTAAATTCAAACTACAAAGAAGCAAATAAATATCTAATTCAAAGCGAACTCATTGATAAAA 

AATTTTTAAAATATAAAATATTTAAAATCAAAAATATAAATGGAATTTTTAAAAGCCATTCACTAATAT^ 
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AAAAGGATTTTACAAATTAGAACTTTACATAGAAAATAATGCAGAACCTCTAAAAATATTTAACCTTAACATTACT 
TATTTTTTAAAGAATTTAGATAAAATAAGTAATGAAATGATTTTTTTCCCAAGGGAATGA 

t210.nt 

AGACTTTTGGACATTTCAATTGAAAAAAGAGCAGATGAAGAAATAAAAAAATATTCGTCTTATAATTTAATTTTAG 
AAAAAGAATACTATACCAATTTTCCAACAAGCGAAATAGAAAAAAATATTTATAAACTAACAGAACATTTTGTAAA 
AAGCATAATGCTCAATAAAACTAACTACAGCTTATTAAATTCAAACTACAAAGAAGCAAATAAATATCTAATTCAA 
AGCGAACTCATTGATAAAAAATTTTTAAAATATAAAATATTTAAAATCAAAAATATAAATGGAATTTTTAAAAGCC 
ATTCACTAATATATACAAAAAAAGGATTTTACAAATTAGAACTTTACATAGAAAATAATGCAGAACCTCTAAAAAT 
ATTTAACCTTAACATTACTTATTTTTTAAAGAATTTAGATAAAATAAGTAATGAAATGATTTTTTTCCCAAGGGAA 
TGA 

f22 .aa 

MLKTLTKIITISCLIVGCASLPYTPPKQNIJSre'LMELLPGANLYAHVNLIKKRSIYNSLSPKYKSVI^ 
KKENNDFALLIMGNFPKDIFWGIHKNRNTESIGNIFTNPKWKLKNSNIYIIPNKARTSIAITQKDITAKDNNML^ 
KYIGEIEKNEMFFWIQDPTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVL 
TNMTNLTISSHIKTTIKDQNTVEIEFNIQKSSVESLIEKLASNIQT 

t22 .aa 

PYTPPKQNLI^LMELLPGANLYAHVNLIKimSIYNSLSPKYKSVLGLISNLYFSYKKENNDFALLIMGNFPKDIFW 
GIHKNRNTESIGNIFTNPKWKLKNSNIYIIPNKARTSIAITQKDITAKDNNMLTTKYIGEIEKNEMFFWIQ 
LPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVLTNMTNLTISSHIKTTIKDQNT 
VEIEFNIQKSSVESLIEKLASNIQT 

f22 .nt 

ATGTTAAAAACATTAACAAAAATAATTACCATTTCATGCCTCATAGTGGGATGCGCAAGCCTGCCTTACACTCCTC 
CAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTATACGCCCATGTAAATTTAATTAAAAA 
CAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTTATACTTTAGCTAT 
AAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGATATTTTCTGGGGAATTCATAAAA 
ATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAAAATTCAAATATATACATTAT 
TCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGCAAAAGACAATAATATGCTAACAACA 
AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGATCCAACATTATTGCTCCCAAACCAAA 
TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATCAAGAAGAATATAT 
TTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAAAAAAGTTAATTCCAACCGTCTTG 
ACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACGGTTGAAATAGAAT 
TTAATATTCAAAAATCTAGTGTTGAAAGCCTTATAGAAAAACTAGCTTCAAATATTCAAACCTAA 

t22.nt 

CCTTACACTCCTCCAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTATACGCCCATGTAA 

ATTTAATTAAAAACAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTT 

ATACTTTAGCTATAAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGATATTTTCTGG 

GGAATTCATAAAAATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAA 

ATATATACATTATTCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGCAAAAGACAATAA 

TATGCTAACAACAAAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGATCCAACATT^ 

CTCCCAAACCAAATAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATC 

AAGAAGAATATATTTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAAAAAAGTTAAT 

TCCAACCGTCTTGACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACG 

GTTGAAATAGAATTTAATATTCAAAAATCTAGTGTTGAAAGCCTTATAGAAAAACTAGCTTCAAATATTCAAACCT 

AA 



£221 .aa 
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MGITVFYLFSIFASFVLGSSMDSVKENVLKSTIFYYDVEEVEFPYARKQTLQFIAKTHLKYAVFNFDKNKMFSYTF 
VFDKKLISQYAIFIEVKKKFGEATLVTPLNYLWDLGDSIIVLNKNILRITLKSYISNYNK 

t221.aa 

SI^SVKEWLKSTIFYYDVEEVEFPYARKQTLQFIAKTHLKYAVFNFDKNKMFSYTFVFDKKLISQYAIFIEVKKK 
FGEATLVTPLNYLWDLGDSIIVLNKNILRITLKSYISNYNK 

f221.nt 

ATGGGTATTACAGTTTTTTATTTATTTTCTATTTTTGCATCTTTTGTTCTGGGTTCTAGCATGGATTCTGTTAAAG 
AGAATGTTCTCAAGAGCACTATTTTTTATTATGATGTTGAAGAAGTTGAATTTCCTTATGCTAGGAAGCAGACTTT 
ACAATTTATTGCTAAAACCCATTTAAAATATGCTGTTTTTAATTTTGACAAAAATAAAATGTTTTCGTACACTTTT 
GTTTTTGATAAAAAATTAATATCTCAGTATGCAATTTTTATTGAGGTAAAGAAAAAGTTTGGCGAGGCTACACTAG 
TAACGCCTTTGAATTATTTATGGGATCTTGGTGATTCTATTATTGTTTTAAATAAAAATATTTTAAGAATTACTTT 
AAAATCTTATATTTCAAATTATAATAAATGA 

t221.nt 

AGCATGGATTCTGTTAAAGAGAATGTTCTCAAGAGCACTATTTTTTATTATGATGTTGAAGAAGTTGAATTTCCTT 
ATGCTAGGAAGCAGACTTTACAATTTATTGCTAAAACCCATTTAAAATATGCTGTTTTTAATTTTGACAAAAATAA 
AATGTTTTCGTACACTTTTGTTTTTGATAAAAAATTAATATCTCAGTATGCAATTTTTATTGAGGTAAAGAAAAAG 
TTTGGCGAGGCTACACTAGTAACGCCTTTGAATTATTTATGGGATCTTGGTGATTCTATTATTGTTTTAAATAAAA 
ATATTTTAAGAATTACTTTAAAATCTTATATTTCAAATTATAATAAATGA 

f253.aa 

MYMENIEVRGQPNFFGLIPFFVFIIIYLGTGIYLGVIGVEMAFYQLPASVAMFFASIVCFLVFKGKFSDKIHIFIK 

GAAQYDIILMCLIFMLSGAFSSLCKEIGCVETVANLGIKYINPNWIVSGIFFVTCFLSFSAGTSVGSIVAIAPIAF 

NIAVKSGINPNLIAASVMCGAMFGDNLSLISDTTIVSSRTQGSSILDVFISSSFYAFPSAILTFFSFFFLSENLSN 

ATNFLHESSIDLVKTVPYLMIIFFSIAGMNVFIVLFLGILSICLISVLYGNLYFLDVMKNINKGFLNMADL^ 

LTGGVSFAVIHNGGFKWLLIKLKSLIRGKSSAEFSIGAFVSIVDVFLAJSfNTIAILICGKVAKKIAFEN^ 

SILDMFSCIFQGIIPYGAQMIILVNFSNGLVSPISILPFLVYFGFLLFFVILSILGLDIKKVFLFFLKK 

t253 .aa 

LVFKGKFSDKIHIFIKGAAQYDIILMCLIFMLSGAFSSLCKEIGCVETVANLGIKYINPNWIVSGIFFVTCFLSFS 
AGTSVGSIVAIAPIAFNIAVKSGINPWLIAASVMCGAMFGDNLSLISDTTIVSSRTQGSSILDVFISSSFYAFPSA 
ILTFFSFFFLSENLSNATNFLHESSIDLVKTVPYIJ^IIFFSLAGMNVFIVLFLGILSICLISVLYGNLYFLDVMI^ 
INKGFLNMADLIFLSILTGGVSFAVIHNGGFKWLL.IKLKSLIRGKSSAEFSIGAFVSIVDVFLANNTIAILICGKV 
AKKIAFENNISVQRSASILDMFSCIFQGIIPYGAQMIILVNFSNGLVSPISILPFLVYFGFLLFFVILSILGLDIK 
KVFLFFLKK 

f253 .nt 

ATGTATATGGAAAATATTGAAGTAAGAGGGCAGCCAAATTTTTTTGGGCTTATTCCTTTTTTTGTTTTTATT^ 

TCTATTTAGGCACGGGGATTTATITGGGAGTTATTGGTGTAGAAATGGCCTTTTATCAACTGCCGGCTAGTGTTG 

AATGTTTTTTGCTTCCATTGTTTGTTTTTTGGTATTTAAAGGAAAATTTTCCGACAAAATTCACATATTTATTAAA 

GGAGCAGCTCAGTACGATATTATACTAATGTGTCTTATTTTTATGCTTTCGGGAGCTTTCTCTTCTCTTTGTAAAG 

AAATAGGCTGCGTTGAAACTGTAGCAAATTTGGGAATTAAATATATTAATCCTAATTGGATTGTTTCTG^ 

TTTTGTAACCTGCTTTCTTTCTTTTTCTGCCGGCACTTCTGTTGGATCTATCGTTGCAATTGCTCCTATTGCTTTT 

AATATTGCTGTTAAAAGCGGCATTAATCCGAATTTAATAGCAGCATCTGTAATGTGTGGAGCTATGTTTGGAGATA 

ATCTTTCTTTAATATCAGATACAACTATTGTTTCTAGTCGAACTCAAGGTAGTAGCATCTTAGATGTTT 

TAGCAGTTTTTATGCTTTTCCATCCGCCATACTAACTTTTTTTTCTTTTTTCTTTCTTTCTGAAAATTTC 

GCCACAAACTTTTTACACGAAAGTTCAATAGATTTAGTGAAAACTGTGCCTTATTT^ 
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TAGCTGGAATGAATGTTTTTATAGTTCTTTTTTTAGGTATTCTTTCTATATGTCTTATTAGCGTTTTGTATGGTAA 
TTTATACTTTCTAGATGTAATGAAAAACATTAATAAAGGGTTTTTAAATATGGCGGATTTGATTTTTCTTTCAATT 
TTAACAGGGGGAGTTTCTTTTGCCGTGATTCATAATGGAGGCTTTAAATGGCTACTTATTAAATTAAAATCCTTGA 
TTAGAGGAAAAAGTTCAGCGGAATTTTCTATTGGGGCTTTTGTTTCAATAGTTGATGTTTTTCTTGCTAATAACAC 
AATTGCCATACTTATTTGCGGCAAAGTAGCAAAAAAGATAGCTTTTGAAAATAACATCAGTGTTCAAAGAAGTGCT 
TCTATTTTAGATATGTTCTCTTGTATTTTTCAAGGCATTATTCCTTATGGTGCGCAAATGATTATTTTAGTGAATT 
TTTCAAATGGACTTGTGTCGCCAATTAGTATTTTGCCATTTTTAGTTTATTTTGGATTTTTATTGTTTTTTGTTAT 
TTTATCTATTTTGGGCCTTGATATAAAAAAAGTTTTTTTATTTTTTTTAAAAAAATAA 

t253 .nt 

TTGGTATTTAAAGGAAAATTTTCCGACAAAATTCACATATTTATTAAAGGAGCAGCTCAGTACGATATTATACTAA 
TGTGTCTTATTTTTATGCTTTCGGGAGCTTTCTCTTCTCTTTGTAAAGAAATAGGCTGCGTTGAAACTGTAGCAAA 
TTTGGGAATTAAATATATTAATCCTAATTGGATTGTTTCTGGTATATTTTTTGTAACCTGCTTTCTTTCTTTTTCT 
GCCGGCACTTCTGTTGGATCTATCGTTGCAATTGCTCCTATTGCTTTTAATATTGCTGTTAAAAGCGGCATTAATC 
CGAATTTAATAGCAGCATCTGTAATGTGTGGAGCTATGTTTGGAGATAATCTTTCTTTAATATCAGATACAACTAT 
TGTTTCTAGTCGAACTCAAGGTAGTAGCATCTTAGATGTTTTTATTAGTAGCAGTTTTTATGCTTTTCCATCCGCC 
ATACTAACTTTTTTTTCTTTTTTCTTTCTTTCTGAAAATTTGTCCAATGCCACAAACTTTTTACACGAAAGTTCAA 
TAGATTTAGTGAAAACTGTGCCTTATTTAATGATTATATTTTTCTCTTTAGCTGGAATGAATGTTTTTATAGTTCT 
TTTTTTAGGTATTCTTTCTATATGTCTTATTAGCGTTTTGTATGGTAATTTATACTTTCTAGATGTAATGAAAAAC 
ATTAATAAAGGGTTTTTAAATATGGCGGATTTGATTTTTCTTTCAATTTTAACAGGGGGAGTTTCTTTTGCCGTGA 
TTCATAATGGAGGCTTTAAATGGCTACTTATTAAATTAAAATCCTTGATTAGAGGAAAAAGTTCAGCGGAATTTTC 
TATTGGGGCTTTTGTTTCAATAGTTGATGTTTTTCTTGCTAATAACACAATTGCCATACTTATTTGCGGCAAAGTA 
GCAAAAAAGATAGCTTTTGAAAATAACATCAGTGTTCA7VAGAAGTGCTTCTATTTTAGATATGTTCTCTTGTATTT 
TTCAAGGCATTATTCCTTATGGTGCGCAAATGATTATTTTAGTGAATTTTTCAAATGGACTTGTGTCGCCAATTAG 
TATTTTGCCATTTTTAGTTTATTTTGGATTTTTATTGTTTTTTGTTATTTTATCTATTTTGGGCCTTGATATAAAA 
AAAGTTTTTTTATTTTTTTTAAAAAAATAA 

£265. aa 

mKCFVSLSLLLIFFACSSWEIELNDDISGIVSIFVNVNREFEKIRKELLTTLVGEEIANMPLFPVDEIKKYFI^ 

GEEKLGLKLLSIKTQGDSINLWKFDNLIKILGDYMKKPDISVFKIEKKDGKNIIELNINLENATKNINENKEYIS 

DAIiAALLPSDEIPMSAKEYKDVLVYFLSDFTSKASELIDNSKLNLWKTSRWQEQFGFKQINSNTLRFEMDlW 

LSLETPIKLRLV 

Y 

t265 .aa 

SNVEIELNDDISGIVSIFVNVNREFEKIRKELLTTLVGEEIANMPLFPVDEIKKYFKNGEEKLGLKLLSIKTQGDS 
INLVVKFDNLIKILGDYMKKPDISVFKIEKKDGKNIIELNINLENATKNINENKEYISDALAALLPSDEIPMSAKE 
YKDVLVYFLSDFTSKASELIDNSKLNLWKTSRNVQEQFGFKQINSNTLRFEMDMVKGLSLETPIKLRLVY 

f265.nt 

ATGAGAAAGTGTTTTGTTAGCTTGAGTTTATTGTTGATTTTTTTTGCTTGTAGCTCTAATGTTGAAATTGAGTTAA 

ATGATGATATTAGTGGTATTGTTTCAATATTTGTTAATGTTAATAGAGAATTTGAAAAAATTAGAAAAGAACTCTT 

AACAACTTTGGTGGGAGAAGAAATTGCAAATATGCCTCTTTTTCCTGTAGATGAAATAAAAAAATACTTTAAAAAT 

GGAGAGGAAAAGCTTGGGCTTAAGCTTTTGAGTATTAAAACCCAAGGAGATTCTATTAATTTAGTTGTTAAGTTTG 

ATAATTTAATTAAAATTTTAGGCGATTATATGAAAAAACCCGATATATCTGTGTTTAAGATAGAAAAAAAAGATGG 

TAAAAATATTATTGAACTTAATATTAATTTGGAAAACGCTACTAAGAATATTAATGAAAATAAAGAATATATTAGT 

GATGCACTTGCTGCTCTTTTGCCATCGGATGAGATCCCAATGTCTGCCAAAGAATATAAAGATGTTTTGGTTTATT 

TTTTATCGGATTTTACTTCCAAAGCAAGTGAACTTATTGACAATTCCAAACTTAATCTTGTAGTTAAGACTTCTAG 

AAATGTTCAAGAACAATTTGGATTCAAACAAATTAACTCAAACACACTGCGGTTTGAGATGGATATGGTTA^ 

TTAAGTCTTGAAACACCAATAAAACTTAGATTAGTTTATTGA 



t265.nt 
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TCTAATGTTGAAATTGAGTTAAATGATGATATTAGTGGTATTGTTTCAATATTTGTTAATGTTAATAGAGAATTTG 

AAAAAATTAGAAAAGAACTCTTAACAACTTTGGTGGGAGAAGAAATTGCAAATATGCCTCTTTTTCCTGTAGATGA 

AATAAAAAAATACTTTAAAAATGGAGAGGAAAAGCTTGGGCTTAAGCTTTTGAGTATTAAAACCCAAGGAGATTCT 

ATTAATTTAGTTGTTAAGTTTGATAATTTAATTAAAATTTTAGGCGATTATATGAAAAAACCCGATATATCTGTGT 

TTAAGATAGAAAAAAAAGATGGTAAAAATATTATTGAACTTAATATTAATTTGGAAAACGCTACTAAGAAT^ 

TGAAAATAAAGAATATATTAGTGATGCACTTGCTGCTCTTTTGCCATCGGATGAGATCCCAATGTCTGCCAAAGAA 

TATAAAGATGTTTTGGTTTATTTTTTATCGGATTTTACTTCCAAAGCAAGTGAACTTATTGACAATTCCAAACTTA 

ATCTTGTAGTTAAGACTTCTAGAAATGTTCAAGAACAATTTGGATTCAAACAAATTAACTCAAACACACTGCGGTT 

TGAGATGGATATGGTTAAAGGATTAAGTCTTGAAACACCAATAAAACTTAGATTAGTTTATTGA 

f269,aa 

MNIRKLLFCIFFMNISFLLFAGDYKGLDFKIKFFNQSIYRVNSNVFIEVSLSNASESVLTLEIGDINSFGFDFDVT 

DTTNIKVKRPIEYVKKRSKNVAIPVRNMSLRPNEKFSWINLNQFVKFSKIXSVYFVKGIFFPDISDPSKK^ 

TLFLNDGFDENPGSIDLVNLSENNDIQDILKKKKLSPDEIVKYLLKALQLGKKEKFFLYLDIEGLLLNDKGKAYLY 

KQKLSPIPNKNWEEYKEYLWNSNNSDISKAPNKFSIIETTYSDTSGKVIADLYFDDGQFYISKRYTFFFKKYDYY 

WIIYDYIVQNTGIKEK 

t269 .aa 

GDYKGLDFKIKFFNQSIYRVNSWFIEVSLSNASESVLTLEIGDINSFGFDFDVTDTTNIKVKRPIEYVKKRSKNV 
AIPVRNMSLRPNEKFSWINLNQFVKFSKDGVYFVKGIFFPDISDPSKKKESNIITLFLNDGFDENPGSIDLVNLS 
ENNDIQDILKKKKLSPDEIVKYLLKALQLGKKEKFFLYLDIEGLLLNDKGKAYLYKQKLSPIPNKNWEEYKEYLW 
NSNNSDISKAPNKFSIIETTYSDTSGKVIADLYFDDGQFYISKRYTFFFKKYDYYWIIYDYIVQNTGIKEK 

£269. nt 

ATGAATATTAGAAAATTGCTTTTTTGTATCTTTTTTATGAATATTTCTTTTCTTTTGTTTGCGGGAGATTACAAGG 

GCCTTGATTTTAAAATCAAGTTTTTTAATCAATCTATTTATCGTGTCAATAGTAATGTTTTTATTGAAGTTTCTCT 

TAGTAATGCGTCTGAGAGTGTTTTAACTTTAGAAATAGGCGATATTAATTCTTTTGGCTTTGATTTTGATGTTACT 

GATACCACCAATATTAAAGTTAAAAGACCTATTGAATATGTTAAAAAGAGATCTAAAAATGTTGCAATTCCTGTTA 

GAAATATGAGCTTGAGACCTAATGAAAAATTTTCTGTAGTTATTAACTTAAATCAATTTGTTAAGTTTAGTAAAGA 

TGGAGTTTATTTTGTTAAGGGTATTTTTTTCCCAGACATTTCAGATCCATCTAAGAAAAAAGAATCCAATATTATT 

ACGCTTTTTTTGAATGATGGTTTTGATGAAAATCCAGGTAGCATAGACCTTGTTAATTTGTCTGAAAATAATGATA 

TTCAAGATATCTTGAAAAAGAAAAAATTATCTCCCGATGAAATTGTTAAATATTTGTTAAAGGCATTGCAGCTTGG 

GAAAAAAGAAAAGTTCTTTTTATATCTTGATATTGAAGGTTTGTTATTAAATGACAAGGGCAAGGCATACCTTTAT 

AAGCAAAAGTTATCACCTATTCCCAATAAAAATGTAGTTGAAGAGTATAAAGAATATTTGTGGAATTCTAATAATT 

CGGATATTTCAAAAGCACCAAATAAATTTTCTATTATTGAAACTACTTATTCTGATACTTCTGGCAAGGTGATTG 

TGATTTATATTTTGACGATGGGCAATTTTATATTTCCAAAAGATATACTTTCTTCTTTAAAAAATATGATTATTAT 

TGGATAATTTATGATTACATTGTTCAAAATACTGGCATTAAGGAAAAGTAA 

t269 .nt 

GGAGATTACAAGGGCCTTGATTTTAAAATCAAGTTTTTTAATCAATCTATTTATCGTGTCAATAGTAATGTTTTTA 

TTGAAGTTTCTCTTAGTAATGCGTCTGAGAGTGTTTTAACTTTAGAAATAGGCGATATTAATTCTTTTGGCTTTGA 

TTTTGATGTTACTGATACCACCAATATTAAAGTTAAAAGACCTATTGAATATGTTAAAAAGAGATCTAAAAATGTT 

GCAATTCCTGTTAGAAATATGAGCTTGAGACCTAATGAAAAATTTTCTGTAGTTATTAACTTAAATCAATTTC 

AGTTTAGTAAAGATGGAGTTTATTTTGTTAAGGGTATTTTTTTCCCAGACATTTCAGATCCATCTAAGAAAAAAGA 

ATCCAATATTATTACGCTTTTTTTGAATGATGGTTTTGATGAAAATCCAGGTAGCATAGACCTTGTTAATTTGTCT 

GAAAATAATGATATTCAAGATATCTTGAAAAAGAAAAAATTATCTCCCGATGAAATTGTTAAATATTTGTTAAAGG 

CATTGCAGCTTGGGAAAAAAGAAAAGTTCTTTTTATATCTTGATATTGAAGGTTTGTTATTAAATGACA^ 

GGCATACCTTTATAAGCAAAAGTTATCACCTATTCCCAATAAAAATGTAGTTGAAGAGTATAAAGAATATTTGTGG 

AATTCTAATAATTCGGATATTTCAAAAGCACCAAATAAATTTTCTATTATTGAAACTACTTATTCTGATAC 

GCAAGGTGATTGCTGATTTATATTTTGACGATGGGCAATTTTATAT^ 

ATATGATTATTATTGGATAATTTATGATTACATTGTTCAAAATACTGGCATTAAGGAAAAGTAA 
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f29.aa 

MNWLSFFYVLLFLLIFPFELQSNNKENIENLIKLHMLYDLTNNLSKELETINKIKNFDLEQHYLLITKYYLKIKKY 
KEANDFLKKINQKKIKNQKIKNEIISLKLRINEDNINEEEIKKILNNEKNIDVKIIYQIFSLIKFKNKKI^KIK^ 
I ILTNYPKS I YSYKIKRNE 



t2 9.aa 

I^KENIENLIKLHMLYDLTNNLSKELETINKIKNFDLEQHYLLITKYYLKIKKYKEANDFLKKINQKKIKNQKIKN 
EIISLKLRINEDNINEEEIKKIIJSrNEKNIDVTCIIYQIFSLIKFKNKKUyjJKIKNIILTl^ 

f29.nt 

ATGAACTGGCTATCCTTTTTTTATGTTTTATTATTTTTATTAATTTTTCCTTTTGAATTACAGAGTAATAATAAAG 

AAAATATAGAAAATTTAATAAAGCTACATATGCTTTATGATTTAACCAATAACCTGTCAAAAGAATTAGAAACAAT 

AAATAAAATTAAAAATTTTGACTTAGAACAACATTATCTGCTAATTACAAAATATTATCTAAAAATAAAAAAATAT 

AAAGAAGCTAATGATTTTTTAAAAAAAATAAACCAAAAAAAGATCAAAAATCAAAAAATAAAAAACGAAATC^ 

CGCTAAAATTAAGAATAAATGAAGATAATATTAATGAAGAAGAAATCAAAAAAATTTTAAATAACGAAAAAAATA 

AGATGTCAAAATAATTTATCAAATATTCAGTCTTATAAAATTTAAAAATAAAAAATTAGCAAATAAAATTAAAAAC 

ATAATACTAACAAACTATCCCAAAAGCATTTATTCTTATAAAATAAAAAGAAATGAATAA 

t29 .nt 

AATAATAAAGAAAATATAGAAAATTTAATAAAGCTACATATGCTTTATGATTTAACCAATAACCTGTCAAAAGAAT 
TAGAAACAATAAATAAAATTAAAAATTTTGACTTAGAACAACATTATCTGCTAATTACAAAATATTATCTAAAAAT 
AAAAAAATATAAAGAAGCTAATGATTTTTTAAAT^AAAATAAACCAAAAAAAGATCAAAAATCAAAAAATAAAAAAC 
GAAATCATTTCGCTAAAATTAAGAATAAATGAAGATAATATTAATGAAGAAGAAATCAAAAAAATTTTAAATAACG 
AAAAAAATATAGATGTCAAAATAATTTATCAAATATTCAGTCTTATAAAATTTAAAAATAAAAAATTAGCAAATAA 
AATTAAAAACATAATACTAACAAACTATCCCAAAAGCATTTATTCTTATAAAATAAAAAGAAATGAATAA 

f290.aa 

MNSIYVIGKLLLTLFLIFFPFCYNLFAVNLAEINKLSEYAKSIVLIDFDTKRILYSKKPNLVFPPASLTKIVTIYT 

ALIEAEKRNIKLKSIVPISDSASYYNAPPNSSLMFLEKGQIVNFEEILKGLSVSSGNDSSIAIAEFWGNLNSFVN 

LMNIlWLNLGLFNMHFVEPSGYSSENKITALDMAFFVKSYIEKFKFMLNIHSLKYFIYPKSRNLGTALSSKFm 

QRNANLLIYDYPYSrX^IKTGYIKESGmLVATAKKGERRLIAWLGVEKGINGFGEKMRSSIAKNLFEYGFNKYSK 

FPLIVKLKEKVYNGTVDTVALFSKEPFYYILTKDEFDKINISYTVDKLVAPLSGDMPVGRAMIFLENEKIGDVALF 

SGICVKRLGFWQGLYKSFINLFSREY 

t290 .aa 

VNI^EINKLSEYAKSIVLIDFDTKRILYSKKPNLVFPPASLTKIVTIYTALIEAEKRNIKLKSIVPISDSASYYNA 

PPNSSlJyiFLEKGQIWFEEILKGLSVSSGNDSSIAIAEFWGNLNSFWLMNIlWIiNLGLFNlffiFVEPSGYSS^ 

ITALDMAFFVKSYIEKFKFMLNIHSLKYFIYPKSRNLGTALSSKFLNLKQRNANLLIYDYPYSDGIKTGYIKESGL 

NLVATAKKGERRLIAVVLGVEKGINGFGEKMRSSIAiCNLFEYGFNKYSKFPLIVKLKEKVYNGTVI^ 

YYILTKDEFDKINISYTVDKLVAPLSGDMPVGRAMIFLENEKIGDVALFSGKVKRLGFWQGLYKSFINLFSREY 

f290.nt 

atgaatagtatctatgttattgggaaattgttattaactttatttttaatttttttcccgttttgttat;^ 

TTGCAGTTAATTTAGCTGAGATTAATAAATTATGAGAGTATGCAAAGTCAATAGTTTTAATAGATTTTC 

gcgaatactttattctaagaagcccaatttggtttttcctccagcatctcttacaaagattgttacaa 
gctttaattgaagctgaaaagcgaaatataaaattaaaaagcatagttcctattagcgattctgcttcatattata 

ATGCACCCCCCAATTCTTCTTTGATGTTTTTAGAAAAAGGTCAAATTGTTAATTTTGAAGAGATT^ 
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TTCAGTTTCTTCGGGTAATGATTCTTCTATTGCAATTGCTGAGTTTGTAGTAGGCAATTTAAATAGCTTTGTTAAT 

TTAATGAATATTAATGTTTTAAATTTAGGGCTTTTTAATATGCATTTTGTTGAACCTTCTGGATATAGCAGCGAGA 

ATAAGATTACAGCACTAGATATGGCTTTTTTTGTGAAATCTTATATAGAAAAGTTTAAATTTATGCTTAATATTCA 

TTCTTTAAAGTATTTTATTTATCCAAAGAGTAGAAATTTAGGAACTGCTTTGTCATCAAAATTTTTAAACTTAAAA 

CAAAGAAATGCTAATTTATTAATATATGATTACCCTTATTCAGATGGCATTAAAACGGGATATATTAAGGAATCAG 

GCTTAAATCTTGTTGCTACTGCTAAAAAGGGTGAGAGAAGATTAATAGCAGTTGTATTGGGGGTTGAAAAAGGAAT 

TAATGGATTTGGAGAGAAAATGAGATCTTCGATTGCAAAAAATTTATTTGAATATGGATTTAATAAATATTCTAAA 

TTTCCTTTAATAGTAAAATTAAAAGAAAAAGTCTATAATGGTACAGTGGATACAGTTGCTCTTTTTTCTAAAGAGC 

CTTTTTATTATATTTTAACTAAAGATGAATTTGATAAAATTAATATAAGTTATACTGTTGATAAATTGGTTC 

ACTTAGTGGGGATATGCCTGTTGGGAGGGCTATGATTTTTTTAGAAAATGAAAAAATAGGGGATGTTGCT^ 

AGTGGCAAGGTAAAAAGATTAGGGTTTTGGCAAGGTCTTTATAAGAGTTTTATAAATCTTTTTTCAAGAGAGTATT 

AA 



t290 .nt 

GTTAATTTAGCTGAGATTAATAAATTATCAGAGTATGCAAAGTCAATAGTTTTAATAGATTTTGATACTAAGCGAA 
TACTTTATTCTAAGAAGCCCAATTTGGTTTTTCCTCCAGCATCTCTTACAAAGATTGTTACAATTTATACAGCTTT 
AATTGAAGCTGAAAAGCGAAATATAAAATTAAAAAGCATAGTTCCTATTAGCGATTCTGCTTCATATTATAATGCA 
CCCCCCAATTCTTCTTTGATGTTTTTAGAAAAAGGTCAAATTGTTAATTTTGAAGAGATTTTAAAAGGACTTTCAG 
TTTCTTCGGGTAATGATTCTTCTATTGCAATTGCTGAGTTTGTAGTAGGCAATTTAAATAGCTTTGTTAATTTT^T 
GAATATTAATGTTTTAAATTTAGGGCTTTTTAATATGCATTTTGTTGAACCTTCTGGATATAGCAGCGAGAATAAG 
ATTACAGCACTAGATATGGCTTTTTTTGTGAAATCTTATATAGAAAAGTTTAAATTTATGCTTAATATTCATTCTT 
TAAAGTATTTTATTTATCCAAAGAGTAGAAATTTAGGAACTGCTTTGTCATCAAAATTTTTAAACTTAAAACAAAG 
AAATGCTAATTTATTAATATATGATTACCCTTATTCAGATGGCATTAAAACGGGATATATTAAGGAATCAGGCTTA 
AATCTTGTTGCTACTGCTAAAAAGGGTGAGAGAAGATTAATAGCAGTTGTATTGGGGGTTGAAAAAGGAATTAATG 
GATTTGGAGAGAAAATGAGATCTTCGATTGCAAAAAATTTATTTGAATATGGATTTAATAAATATTCTAAATTTCC 
TTTAATAGTAAAATTAAAAGAAAAAGTCTATAATGGTACAGTGGATACAGTTGCTCTTTTTTCTAAAGAGCCTTTT 
TATTATATTTTAACTAAAGATGAATTTGATAAAATTAATATAAGTTATACTGTTGATAAATTGGTTGCTCCACTTA 
GTGGGGATATGCCTGTTGGGAGGGCTATGATTTTTTTAGAAAATGAAAAAATAGGGGATGTTGCTTTGTTTAGTGG 
CAAGGTAAAAAGATTAGGGTTTTGGCAAGGTCTTTATAAGAGTTTTATAAATCTTTTTTCAAGAGAGTATTAA 

f291.aa 

l^SYDFITALVPIILIIIGLGIIKKPAYYVIPISLIATVAIVIFYKNLGIVNTSIJ^LEGALMGIWPIATVIIAAI 

FTYKMSEDQKDIETIKNILSNVSSDRRIIVLLVAWGFGNFLEGVAGYGTAVAIPVSILIAMGFEPFFACLICLIMN 

TSSTAYGSVGIPITSLAQATNLDVNIVSSEIAFQLILPTLTIPFVLVILTGGGIKGLKGVFLLTLLSGMSMAISQV 

FISKTLGPELPAILGSILSMTITIVYARFFGNKETTERQSKNTISLSKGIIACSPYILIVTFIVLVSPLFNKIHEY 

LKTFQSTISIYPEANPLHFKWIISPGFLIILATTISYSIRGVPMLKQLKIFTLTLKJCMALSSFIIICIVAISRLJyiT 

HSGMIRDLANGISIITGKFGPLFSPLIGAIGTFLTGSDTVSNVLFGPLQTQMAENIGANPYWIJ\AA^ 

ISPQNITIATTTAGLIGQEGKLLSKTIIYALYYILATGLLVYLV 

t291 .aa 

QKDIETIKNILSNVSSDRRIIVLLVAWGFGNFLEGVAGYGTAVAIPVSILIAMGFEPFFACLICLIMNTSSTAYGS 

VGIPITSLAQATNLDVNIVSSEIAFQLILPTLTIPFVLVILTGGGIKGLKGVFLLTLLSGMSMAISQVFISKTLGP 

ELPAILGSILSMTITIVYARFFGNKETTERQSKNTISLSKGIIACSPYILIVTFIVLVSPLFNKIHEYLKTFQSTI 

SIYPEANPLHFKWIISPGFLIILATTISYSIRGVPMLKQLKIFTLTLKKMALSSFIIICIVAISRLMTHSGMIRDL 

ANGISIITGKFGPLFSPLIGAIGTFLTGSDTVSNVLFGPLQTQMAENIGANPYWIJUVANTTGATGG^ 

ATTTAGLIGQEGKLLSKTI I YALYYILATGLLVYLV 

f291.nt 

ATGAATTCTTATGATTTTATAACAGCTTTGGTACCAATAATCCTAATAATTATTGGACTTGGCATAATAA^^ 

CAGCTTACTATGTAATACCCATATCATTAATAGCCACCGTTGCTATAGTTATATTTTATAAAAACTTC^ 

AAACACAAGTCTTGCAATGCTTGAGGGCGCCTTAATGGGGATATGGCCAATAGCAACTGTAATTAT^ 
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TTTACATACAAAATGTCAGAAGATCAAAAAGATATAGAAACTATTAAAAATATTTTATCAAACGTATCTTCTGATA 

GAAGAATTATAGTATTACTAGTTGCATGGGGATTTGGAAATTTTTTAGAAGGAGTTGCTGGATATGGAACTGCTGT 

TGCAATTCCTGTATCAATATTAATAGCAATGGGATTTGAACCATTTTTTGCCTGCTTAATCTGTTTAATAA 

ACCTCATCAACCGCCTACGGATCTGTGGGAATCCCTATAACATCTTTAGCTCAAGCAACTAACTTGGATGTTAACA 

TTGTTTCATCTGAGATTGCATTCCAACTAATACTTCCAACCTTAACAATACCTTTTGTACTGGTAATTCTTACAGG 

AGGGGGCATTAAAGGATTAAAAGGAGTATTCCTTCTTACCTTACTCTCAGGAATGTCAATGGCAATATCTCAAGTA 

TTTATATCAAAAACTTTGGGTCCAGAACTTCCTGCAATCCTTGGAAGCATTCTTTCTATGACAATAACAATAGTTT 

ATGCAAGGTTTTTTGGAAATAAAGAAACTACTGAGCGCCAAAGCAAAAACACAATATCCTTATCAAAAGGAATTAT 

TGCCTGCTCACCCTACATTTTAATAGTAACTTTTATAGTGCTTGTATCTCCTCTTTTTAACAAAATTCATGAATAC 

CTAAAAACTTTTCAAAGCACTATTAGCATTTATCCAGAAGCAAATCCCTTACACTTTAAATGGATTATCTCTCCGG 

GCTTCTTGATTATACTTGCAACAACAATATCCTATTCAATACGGGGAGTTCCAATGTTAAAACAGCTAAAAATATT 

TACATTAACCTTGAAAAAAATGGCATTATCTTCCTTTATAATCATATGCATTGTTGCAATATCAAGATTAATGACA 

CATAGTGGAATGATAAGAGATCTTGCTT^TGGAATCTCAATAATAACAGGTAAATTTGGACCATTATTTAGCCCAC 

TAATTGGAGCTATTGGGACATTTTTAACAGGAAGTGATACGGTTTCAAATGTTCTTTTTGGACCTTTACAAACACA 

AATGGCAGAAAATATTGGAGCAAATCCTTACTGGCTTGCAGCAGCAAATACAACAGGAGCAACTGGAGGGAAAATG 

ATTTCTCCCCAAAACATCACAATAGCAACAACAACTGCTGGATTAATTGGACAAGAAGGCAAGCTTTTATCAAAAA 

CAATAATTTATGCTTTATACTACATTTTAGCAACAGGATTGCTAGTTTATTTAGTATAA 



t291 .nt 

CAAAAAGATATAGAAACTATTAAAAATATTTTATCAAACGTATCTTCTGATAGAAGAATTATAGTATTACTAGTTG 
CATGGGGATTTGGAAATTTTTTAGAAGGAGTTGCTGGATATGGAACTGCTGTTGCAATTCCTGTATCAATATTAAT 
AGCAATGGGATTTGAACCATTTTTTGCCTGCTTAATCTGTTTAATAATGAACACCTCATCAACCGCCTACGGATCT 
GTGGGAATCCCTATAACATCTTTAGCTCAAGCAACTAACTTGGATGTTAACATTGTTTCATCTGAGATTGCATTCC 
AACTAATACTTCCAACCTTAACAATACCTTTTGTACTGGTAATTCTTACAGGAGGGGGCATTAAAGGATTAAAAGG 
AGTATTCCTTCTTACCTTACTCTCAGGAATGTCAATGGCAATATCTCAAGTATTTATATCAAAAACTTTGGGTCCA 
GAACTTCCTGCAATCCTTGGAAGCATTCTTTCTATGACAATAACAATAGTTTATGCAAGGTTTTTTGGAAATAAAG 
AAACTACTGAGCGCCAAAGCAAAAACACAATATCCTTATCAAAAGGAATTATTGCCTGCTCACCCTACATTTTAAT 
AGTAACTTTTATAGTGCTTGTATCTCCTCTTTTTAACAAAATTCATGAATACCTAAAAACTTTTCAAAGCACTATT 
AGCATTTATCCAGAAGCAAATCCCTTACACTTTAAATGGATTATCTCTCCGGGCTTCTTGATTATACTTGCAACAA 
CAATATCCTATTCAATACGGGGAGTTCCAATGTTAAAACAGCTAAAAATATTTACATTAACCTTGAAAAAAATGGC 
ATTATCTTCCTTTATAATCATATGCATTGTTGCAATATCAAGATTAATGACACATAGTGGAATGATAAGAGATCTT 
GCTAATGGAATCTCAATAATAACAGGTAAATTTGGACCATTATTTAGCCCACTAATTGGAGCTATTGGGACATTTT 
TAACAGGAAGTGATACGGTTTCAAATGTTCTTTTTGGACCTTTACAAACACAAATGGCAGAAAATATTGGAGCAAA 
TCCTTACTGGCTTGCAGCAGCAAATACAACAGGAGCAACTGGAGGGAAAATGATTCTCCCCAAAACATCACAATAG 
CAACAACAACTGCTGGATTAATTGGACAAG 



f296 .aa 

MPSPIRVFFLVLLFIFIFNPVLIAMLFILFPFILILFSFLGVFRIYFTRDYSYSRSREFEFYKLSFLLMAKLLSIL 
GTVTGEQLNYWFIINSLNLSERGKSELYTIFHSAITKNNNADKILYTLKLGYFQHm 

KNLEAEKFISYVGVFLELESIX5YEAYKDINIKIWPYSVLGLTYSASDDEVKKAYKSLVIKYHPDKFANDPVRQKD 
ANDKFIKIQDAYEKICKERNIR 

t296 .aa 

lYFTRDYSYSRSREFEFYKLSFLLMAKLLSILGTVTGEQLNYVNFIINSLNLSERGKSELYTIFHSAITKN^ 

ILYTLKLGYFQHKDLFIWLFATLKEINRLSRYKNLEAEKFISYVGVFLELESDGYEAYKDINIKIVNPYSVLGLTY 

SASDDEVKKAYKSLVIKYHPDKFANDPVRQKDANDKFIKIQDAYEKICKERNIR 

f296.nt 

ATGCCAAGCCCAATTAGAGTGTTTTTTTTAGTGTTGTTGTTTATTTTTATTTT 

TTTTTATTTTATTTCCTTTTATTTTGATATTATTTAGTTTTTTAGGTGTTTTTAGAATATACTTTAC;^ 
CTCATATTCTAGATCTAGAGAGTTTGAATTTTATAAACTTTCTTTTTTATT 
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GGAACTGTAACTGGGGAGCAGCTAAATTATGTCAATTTTATTATCAATTCTTTGAATTTGTCTGAACGTGGTAAAT 

CAGAATTGTATACCATTTTTCATTCTGCTATTACTAAAAATAATAATGCTGATAAAATTTTATATACCCTTAAGCT 

TGGTTATTTTCAGCACAAAGATCTTTTTATATGGCTTTTTGCCACTCTTAAAGAAATTAACAC^ 

AAAAATTTAGAAGCTGAAAAATTTATTTCTTATGTTGGTGTTTTTTTAGAACTTGAATCTGATGGTTATGAAGCTT 

ATAAAGATATTAATATTAAAATTGTAAATCCTTATAGTGTTTTGGGGTTAACATATAGTGCTAGCGATGATGAGGT 

TAAAAAGGCGTATAAAAGCCTTGTTATAAAATATCATCCTGATAAGTTTGCAAATGATCCTGTAAGACAAA^ 

GCAAATGATAAATTTATAAAAATTCAAGATGCTTATGAAAAAATTTGCAAGGAAAGAAATATAAGGTAA 

t296.nt 

ATATACTTTACAAGGGATTACTCATATTCTAGATCTAGAGAGTTTGAATTTTATAAACTTTCTTTTTTATTAATGG 
CTAAATTGCTATCTATTTTAGGAACTGTAACTGGGGAGCAGCTAAATTATGTCAATTTTATTATCAATTCTTTGAA 
TTTGTCTGAACGTGGTAAATCAGAATTGTATACCATTTTTCATTCTGCTATTACTAAAAATAATAATGCTGATAAA 
ATTTTATATACCCTTAAGCTTGGTTATTTTCAGCACAAAGATCTTTTTATATGGCTTTTTGCCACTCTTAAAGAAA 
TTAACAGGCTTTCTAGGTATAAAAATTTAGAAGCTGAAAAATTTATTTCTTATGTTGGTGTTTTTTTAGAACTTGA 
ATCTGATGGTTATGAAGCTTATAAAGATATTAATATTAAAATTGTAAATCCTTATAGTGTTTTGGGGTTAACATAT 
AGTGCTAGCGATGATGAGGTTAAAAAGGCGTATAAAAGCCTTGTTATAAAATATCATCCTGATAAGTTTGCAAATG 
ATCCTGTAAGACAAAAAGATGCAAATGATAAATTTATAAAAATTCAAGATGCTTATGAAAAAATTTGCAAGGAAAG 
AAATATAAGGTAA 

f3 .aa 

MKKKNLSIYMIMLISLLSCNTSDPNELTRKKMQDKNVKILGFLEKIQADNKEIVEKHIEKKEKQWQAASVAPim/ 

ESNFPYYLQEEIEIKEEELVPNTDEEKKAEKAISDGSLEFAKLVDDENKLKNESAQLESSFlSnwyKEILELADLIQ 

AEVHVAGRINSYIKKRKTTKEKEYKKREIKNKIEKQALIKLFNQLLEKRGDIENLHTQLNSGLSERASAKYFFEKA 

KETLKAAITERLNNKRKNRPWWARRTHSNLAIQAKNEAEDALNQLSTSSFRILEAMKIKEDVKQLLEEV^ 

KSKIFSSGDRLYDFLETSK 

t3 . aa 

NELTRKKMQDKWKILGFLEKIQADNKEIVEKHIEKKEKQMVQAASVAPINVESNFPYYLQEEIEIKEEELVPNTD 
EEKKAEKAISDGSLEFAKLVDDENKLKNESAQLESSFN1WYKEILEIJU:^LIQAEVHVAGRINSYIKKRKTTKEK 
KKREIKNKIEKQALIKLFNQLLEKRGDIENLHTQLNSGLSERASAKYFFEKAKETLKAAITERL]SnSIKRKNRPWW 
RTHSNIiAIQAKNEAEDALNQLSTSSFRILEAMKIKEDVKQLLEEVKSFLiDSSKSKIFSSGDRLYDFLETSK 

f3.nt 

ATGAAAAAAAAAAATTTATCAATTTACATGATAATGCTAATAAGTTTATTATCATGTAATACAAGTGACCCCAATG 
AATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTTAGGATTTTTAGAGAAAATTCAAGCAGATAATAA 
AGAAATTGTTGAAAAACATATAGAAAAAAAAGAAAAACAAATGGTGCAGGCTGCTTCTGTAGCACCTATTAATGTA 
GAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAAAAGAAGAAGAGTTGGTTCCAAATACTGATGAAG 
AAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTGAATTTGCTAAATTAGTTGATGATGAAAATAAACTTAA 
AAATGAATCTGCGCAATTAGAATCTAGTTTTAATAATGTTTATAAAGAAATCTTAGAACTTGCAGATTTAATACAA 
GCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATATAAAAAAAAGAAAGACCACTAAAGAAAAAGAATATAAGA 
AGAGAGAAATTAAGAATAAGATAGAAAAACAGGCTCTAATTAAGTTGTTCAATCAGTTATTAGAAAAAAGAGGCGA 
TATTGAAAATCTTCATACTCAATTAAATAGTGGACTTAGCGAGAGAGCATCTGCAAAATACTTTTTTGAGAAAGCC 
AAAGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATAACAAACGTAAAAATCGGCCATGGTGGGCAAGAAGAA 
CACATAGTAATTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCTTTAAACCAATTAAGTACTTCTTCTTTTAG 
GATACTTGAAGCAATGAAAATAAAGGAAGATGTAAAACAGCTTCTTGAAGAAGTAAAATCTTTTCTAGATTCTTCA 
AAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATTTTTTAGAGACGAGTAAATAA 

t3 .nt 

AATGAATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTTAGGATTTTTAGAGAA 
ATAAAGAAATTGTTGAAAAACATATAGAAAAAAAAGAAAAACAAATGGTGCAGGCTGCTTCTC 

TGTAGAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAAAAGAAGAAGAGTTGGTTCCAAATACTGAT 
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GAAGAAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTGAATTTGCTAAATTAGTTGATGATGAAAATAAAC 
TTAAAAATGAATCTGCGCAATTAGAATCTAGTTTTAATAATGTTTATAAAGAAATCTTAGAACTTGCAGATTTAAT 
ACAAGCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATATAAAAAAAAGAAAGACCACTAAAGAAAAAGAATAT 
AAGAAGAGAGAAATTAAGAATAAGATAGAAAAACAGGCTCTAATTAAGTTGTTCAATCAGTTATTAGAAAAAAGAG 
GCGATATTGAAAATCTTCATACTCAATTAAATAGTGGACTTAGCGAGAGAGCATCTGCAAAATACTTTTTTGAGAA 
AGCCAAAGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATAACAAACGTAAAAATCGGCCATGGTGGGCAAGA 
AGAACACATAGTAATTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCTTTAAACCAATTAAGTACTTCTTCTT 
TTAGGATACTTGAAGCAATGAAAATAAAGGAAGATGTAAAACAGCTTCTTGAAGAAGTAAAATCTTTTCTAGATTC 
TTCAAAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATTTTTTAGAGACGAGTAAATAA 

f30.aa 

Ml^KILTLLVLILSISSVLMLSKSITKKSKYKIIRDYFINSNWLVKIENKDLKFTISKPIYDKKLN^ 
SHFLISNNVDIAINTSPYEVKQNMFFPKGLYIYNKKMISKQINNYGEIVIKHNKIILNPKEDEIENCD 
LIKNGKYKKNFKETRHPRTIIGTDKl^KHLFLVTIEGRGVNNSKGASLNEAIDFALSYGMTNAINL^ 
KSNNAPYKLNFTANIFGQERPVPFHLGIKLPN 

t3 0.aa 

LSKSITKKSKYKIIRDYFINSNYVLVKIENKDLKFTISKPIYDKKLNNYFFKGQTTSHFLISNl^rV'DIAINTSPY 
KQNMFFPKGLYIYNKKMISKQINNYGEIVIKHNKIILNPKEDEIENCDYGFSGFFVLIKNGKYKKNFKETRHPRTI 
IGTDKNNKHLFLVTIEGRGVNNSKGASLNEAIDFALSYGMTNAINLDGGGSSTLWKSNNAPYKLNFTANIFGQER 
PVPFHLGIKLPN 

fSO.nt 

ATGAATAAAAAAATATTAACACTGCTAGTATTGATTTTAAGTATTTCATCAGTACTAATGCTGTCCAAATCAATCA 
CCAAAAAATCCAAATACAAAATTATTAGGGATTATTTCATAAACAGCAATTATGTTCTGGTGAAAATTGAAAATAA 
AGATCTAAAATTTACCATATCAAAACCTATTTACGACAAAAAGCTAAATAATTACTTCTTTAAAGGCCAAACAACA 
AGCCATTTCTTAATTTCTAACAATGTTGACATTGCAATTAACACAAGTCCATACGAAGTTAAACAAAACATGTTTT 
TCCCAAAAGGACTATACATATATAATAAAAAAATGATTTCAAAACAAATAAATAACTACGGAGAGATTGTAATAAA 
GCACAACAAAATTATATTAAATCCCAAGGAAGACGAAATAGAAAACTGCGATTATGGATTTAGCGGATTTTTTGTT 
TTAATCAAAAACGGAAAGTATAAAAAAAATTTTAAAGAAACAAGGCACCCAAGAACAATAATAGGAACTGATAAAA 
ATAACAAGCATTTATTTCTTGTTACAATAGAAGGAAGGGGTGTCAATAATAGCAAAGGGGCCTCTCTTAATGAAGC 
TATTGATTTTGCATTAAGCTACGGCATGACTAACGCTATTAATCTAGACGGGGGGGGCTCAAGCACTCTTGTTGTA 
AAATCAAATAACGCTCCTTACAAATTAAACTTCACAGCAAACATCTTTGGACAGGAAAGACCTGTCCCATTTCATT 
TAGGAATAAAACTTCCTAATTGA 

t3 0.nt 

CTGTCCAAATCAATCACCAAAAAATCCAAATACAAAATTATTAGGGATTATTTCATAAACAGCAATTATGTTCTGG 
TGAAAATTGAAAATAAAGATCTAAAATTTACCATATCAAAACCTATTTACGACAAAAAGCTAAATAATTACTTCTT 
TAAAGGCCAAACAACAAGCCATTTCTTAATTTCTAACAATGTTGACATTGCAATTAACACAAGTCCATACGAAGTT 
AAACAAAACATGTTTTTCCCAAAAGGACTATACATATATAATAAAAAAATGATTTCAAAACAAATAAATAACTACG 
GAGAGATTGTAATAAAGCACAACAAAATTATATTAAATCCCAAGGAAGACGAAATAGAAAACTGCGATTATGGATT 
TAGCGGATTTTTTGTTTTAATCAAAAACGGAAAGTATAAAAAAAATTTTAAAGAAACAAGGCACCCAAGAACAATA 
ATAGGAACTGATAAAAATAACAAGCATTTATTTCTTGTTACAATAGAAGGAAGGGGTGTCAATAATAGCAAAGGGG 
CCTCTCTTAATGAAGCTATTGATTTTGCATTAAGCTACGGCATGACTAACGCTATTAATCTAGACGGGGGGGGCTC 
AAGCACTCTTGTTGTAAAATCAAATAACGCTCCTTACAAATTAAACTTCACAGCAAACATCTTTGGACAGGAAAGA 
CCTGTCCCATTTCATTTAGGAATAAAACTTCCTAATTGA 

f308.aa 

MQLLKNKYPFKRALLDLFLVYAIWIiASPFVNVNSEFVmVDENHFYFWISRSFLillFII 
IPKFKFIFLWDSVLIFIKTILIAMIVIFLIAFLLEYLLPESVLVYYFQ^^SrAGFNWKISSKKAFFIJ^ 
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EELFYRAFVITKFTQMGFPWATAILSSMFFAYGHLYYGILGFLVTFILGIFFAFTYLRYKNVYYVIFIH 
VSSLLLFLN 

t308 .aa 

NSEFWNVDENHFYFWISRSFLIIFIIYFFKLTSSYDDFRVEFFIPKFKFIFLWDSVLIFIKTILIAMIVIFLIAFL 
LEYLLPESVLVYYFQNNAGFNWKISSKKAFFLMTFTSFFTGAFEELFYRAFVITKFTQMGFPWATAILSSMFFAY 
GHLYYGILGFLVTFILGIFFAFTYLRYKNVYYVIFIHSFYNIIVSSLLLFIiN 

f308 .nt 

ATGCAATTGTTAAAAAATAAATATCCATTCAAGCGGGCTTTGCTTGATCTTTTTTTGGTCTATGCTATTGTTTATT 
TGGCATCTCCTTTTGTAAATGTTAATTCAGAATTTTGGAATGTTGATGAAAATCATTTTTATTTTTGGATTTCAAG 
ATCTTTTTTAATTATTTTTATAATTTATTTTTTTAAACTTACCAGTTCTTATGATGATTTTAGAGTAGAGTTTTTT 
ATTCCTAAATTTAAATTTATTTTTCTTTGGGATTCTGTTTTAATTTTTATTAAAACAATATTGATTGCAATGATAG 
TCATTTTTTTAATAGCTTTTTTGCTTGAATATTTGTTGCCAGAATCGGTACTTGTCTATTATTTTCAAAACAATGC 
TGGATTTAATTGGAAGATTAGCAGTAAAAAAGCATTTTTTTTAATGACTTTTACCTCTTTTTTTACAGGAGCTTTT 
GAAGAACTTTTTTACAGGGCTTTTGTTATTACTAAGTTTACACAAATGGGATTTCCTGTTGTAGCTACCGCCATTC 
TTAGTAGTATGTTTTTTGCTTATGGGCATTTATATTATGGAATTTTAGGATTTTTGGTTACATTTATATTAGGGAT 
ATTTTTTGCTTTTACTTATTTAAGGTATAAAAATGTATATTATGTGATTTTTATACATAGTTTTTATAATATTATT 
GTTAGCAGCTTGTTGCTTTTTTTGAATTAA 

t308 -nt 

AATTCAGAATTTTGGAATGTTGATGAAAATCATTTTTATTTTTGGATTTCAAGATCTTTTTTAATTATTTTTATAA 
TTTATTTTTTTAAACTTACCAGTTCTTATGATGATTTTAGAGTAGAGTTTTTTATTCCTAAATTTAAATTTATTTT 
TCTTTGGGATTCTGTTTTAATTTTTATTAAAACAATATTGATTGCAATGATAGTCATTTTTTTAATAGCTTTTTTG 
CTTGAATATTTGTTGCCAGAATCGGTACTTGTCTATTATTTTCAAAACAATGCTGGATTTAATTGGAAGATTAGCA 
GTAAAAAAGCATTTTTTTTAATGACTTTTACCTCTTTTTTTACAGGAGCTTTTGAAGAACTTTTTTACAGGGCTTT 
TGTTATTACTAAGTTTACACAAATGGGATTTCCTGTTGTAGCTACCGCCATTCTTAGTAGTATGTTTTTTGCTTAT 
GGGCATTTATATTATGGAATTTTAGGATTTTTGGTTACATTTATATTAGGGATATTTTTTGCTTTTACTTATTTAA 
GGTATAAAAATGTATATTATGTGATTTTTATACATAGTTTTTATAATATTATTGTTAGCAGCTTGTTGCTTTTTTT 
GAATTAA 

fSl.aa 

MKKYLFFILFLISSl^LIVSYPLSFGGGFSYQFTlsry'TDKTGATKFAPNFTRADHGINLNLFFDANYVLFE^ 

FWTHNGRYFSLGLYGTYPMVFKEQVRMLFPLIGFKYAFDLSSlSnsrFNLFFLSMGLAADLFIPDLDGLYIRPLFMLS 

ISPFSNYKNFSGLTTEIMLGFNIGWRFFN 

t31 .aa 

IVSYPLSFGGGFSYQFTlSr^TDKTGATKFAPNFTRADHGINLJ^LFFDANYVLFEMSYKEAFVWHNGRYFSLGLYG 

YPMVFKEQVRMLFPLIGFKYAFDLSSl^FNLFFLSMGLAADLFIPDLDGLYIRPLFMLSISPFSNYKNFSGLTTEI 

MLGFNIGWRFFN 

£31. nt 

ATGAAGAAATATCTTTTTTTTATTTTATTTCTCATCTCTTCTAATAATTTAATTGTTTCTTATCCACTTTC 

GTGGAGGTTTTTCTTATCAATTTACTAATTATACTGATAAAACAGGCGCCACTAAATTTGCTCCAAATTOT 

AGCAGATCATGGGATTAATTTGAATTTATTTTTTGATGCAAATTATGTACTTTTTGAAATGTCTTACAAAGAGG 

TTTGTTGTTACTCACAATGGGAGATATTTCTCGCTTGGGCTTTATGGAACATATCCAATGGTTTTCAAAGAGCAGG 

TTAGAATGCTTTTCCCATTAATTGGGTTTAAATATGCTTTTGATTTAAGCTCTAATAACTTCAATCTCTTTTTOT 

AAGCATGGGGCTTGCTGCTGATCTTTTTATTCCCGATCTTGATGGTTTATATATTAGGCCTTTGTTTATGCTTTCT 

ATTTCTCCATTTTCTAATTATAAAAATTTTTCTGGGTTAACAACTGAGATTATGCTTGGATTTAATAT^ 

GATTTTTCAATTAG 
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t31 .nt 

ATTGTTTCTTATCCACTTTCTTTTGGTGGAGGTTTTTCTTATCAATTTACTAATTATACTGATAAAACAGGCGCCA 
CTAAATTTGCTCCAAATTTTACCAGAGCAGATCATGGGATTAATTTGAATTTATTTTTTGATGCAAATTATGTACT 
TTTTGAAATGTCTTACAAAGAGGCTTTTGTTGTTACTCACAATGGGAGATATTTCTCGCTTGGGCTTTATGGAACA 
TATCCAATGGTTTTCAAAGAGCAGGTTAGAATGCTTTTCCCATTAATTGGGTTTAAATATGCTTTTGATTTAAGCT 
CTAATAACTTCAATCTCTTTTTTTTAAGCATGGGGCTTGCTGCTGATCTTTTTATTCCCGATCTTGATGGTTTATA 
TATTAGGCCTTTGTTTATGCTTTCTATTTCTCCATTTTCTAATTATAAAAATTTTTCTGGGTTAACAACTGAGATT 
ATGCTTGGATTTAATATCGGTTGGAGATTTTTCAATTAG 



f939.aa 



MKQKYENYFKKRLILNLLIFLLLACSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFKKENGK 

lEKIDLSNSYEFINDIVNISGKTYLLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFLKSVGRDGVKEAYILAIDK 

NNREKIFDLQGSDKTPPQATENDKFYQISNEENLITGNSLKIWQMNISINTYTNIDYQQAKEIMPIIK^ 

WTGGYNNLDTKFKVYSNTNNYTTPIFIQDEVGEFSSYFAREFNDAILIGSNNGFAEFTKNKEGIFALFIAPSKSVE 

PGAYNGSQLSKTGLNDIIPVSNNTIYILTQGKGLWKLENRKLTKE 

£939. aa 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFKKENGKIEKIDLSNSYEFINDIVNISGKTY 

LIAQl^EEELEVCELNGKDWTLKFKKPLKAYKFLKSVGRDGVKEAYILAIDKNNREKIFDLQGSDKTPPQATENDK 

FYQISNEENLITGNSLKIWQMNIOTTYTNIDYQQAKEIMPIIKTSIRGSSEVLVMTGGYNNLDTKFKWSN 

PIFIQDEVGEFSSYFAREFNDAILIGSNNGFAEFTKNKEGIFALRAPSKSVEPGAYNGSQLSKTGLNDIIPVSNNT 

lYILTQGKGLWKLENR 

KLTKE 

£939. nt 

ATGAAACAAAAATACGAAAACTATTTTAAAAAAAGATTAATTTTAAACCTATTAATATTTTTACTACTAGCATGCT 
CAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATACAATATTTTGGGCAGTTCAAG 
TCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCATGCATTTATTTAAAAAAGAAAACGGCAAG 
ATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAATATATCTGGAAAAACCTATCTTT 
TAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTGGACATTAAAATTTAAAAAACC 
GCTAAAAGCATATAAATTCTTAAAATCCGTAGGAAGAGATGGCGTAAAAGAAGCATATATTTTAGCTATAGATAAA 
AATAATCGTGAGAAAATTTTTGATCTACAAGGATCTGACAAAACACCACCACAAGCTACTGAAAATGACAAATTTT 
ATCAAATATCAAATGAAGAAAACTTAATTACAGGAAATTCACTCAAAATATGGCAAATGAATAACAATACATACAC 
AAACATAGACTATCAACAGGCCAAAGAAATAATGCCTATCATTAAAACAAGCATTAGGGGCTCTTCTGAAGTTTTA 
GTAATGACTGGTGGTTACAATAATTTAGATACAAAATTTAAAGTTTACTCAAATACAAATAATTACACAACGCCAA 
TATTTATTCAAGACGAAGTAGGCGAATTTAGCAGCTACTTTGCAAGAGAATTTAATGATGCGATATTAATCGGAAG 
TAATAATGGATTTGCAGAATTTACAAAAAATAAAGAAGGAATTTTTGCCCTACGGGCACCCTCAAAATCTGTAGAA 
CCTGGAGCTTATAACGGATCTCAGCTAAGCAAAACAGGCCTTAATGATATTATTCCTGTATCAAACAACACGATTT 
ACATATTAACTCAGGGCAAGGGTTTGTGGAAATTGGAAAACAGAAAATTAACTAAAGAATAA 

t939.nt 

TGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATACAATATTTTG^ 

CAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCATGCATTTATTTAAAAAAGAAAACGG 

CAAGATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAATATATC 

CTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTGGACATTAAAATTTAAA^ 
AACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGGAAGAGATGGCGTAAAAGAAGCATATATTTTAGCTATAGA 
TAAAAATAATCGTGAGAAAATTTTTGATCTACAAGGATCTGACAAAACACCACCACAAGCTACTGAAAATC 
TTTTATCAAATATCAAATGAAGAAAACTTAATTACAGGAAATTCACTCAAAATATGGCAAATC 
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ACACAAACATAGACTATCAACAGGCCAAAGAAATAATGCCTATCATTAAAACAAGCATTAGGGGCTCTTCTGAAGT 
TTTAGTAATGACTGGTGGTTACAATAATTTAGATACAAAATTTAAAGTTTACTCAAATACAAATAATTACACAACG 
CCAATATTTATTCAAGACGAAGTAGGCGAATTTAGCAGCTACTTTGCAAGAGAATTTAATGATGCGATATTAATCG 
GAAGTAATAATGGATTTGCAGAATTTACAAAAAATAAAGAAGGAATTTTTGCCCTACGGGCACCCTCAAAATCTGT 
AGAACCTGGAGCTTATAACGGATCTCAGCTAAGCAAAACAGGCCTTAATGATATTATTCCTGTATCAAACAACACG 
ATTTACATATTAACTCAGGGCAAGGGTTTGTGGAAATTGGAAAACAGAAAATTAACTAAAGAATAA 

£739. aa 

MQSGLKIKLILFFCCFACSCDIl^PEIKELDYKINYYFTENRLDYSMSFDFAIKVINSKDVFKLSIENKNTNEFIQ 

VINNNYSSFFIDSSLGKDILYCKDLRFNFFDKTFEDFTSCVRLFDKGMRVYNRELVISLGMSKYDLDDVHJ^^ 

KDMEMLNKLSNSKVFFVKTYKDKLHPVSSWRIDSIDILEIDKAFDNYISFYYVEKNSNLFFKVG 

t739.aa 

CCFACSCDINYPEIKELDYKINYYFTENRLDYSMSFDFAIKVINSKDVFKLSIENKNTNEFIQVINI^nsr^ 

SLGKDILYGKDLRFNFFDKTFEDFTSCWLFDKGmVYNRELVISLGMSKYDLDDVHlSrYVY 

VFFVKTYKDKLHPVSSWRIDSIDILEIDKAFDNYISFYYVEKNSNLFFKVG 

f739.nt 

ATGCAGAGCGGATTAAAAATTAAATTAATATTGTTTTTTTGTTGTTTTGCTTGTTCTTGCGACATAAATTATCCGG 
AGATAAAAGAGCTTGATTATAAGATAAATTATTATTTTACTGAAAATCGCTTAGATTACTCTATGAGTTTTGATTT 
TGCAATTAAAGTTATAAATTCAAAAGATGTTTTTAAATTATCAATAGAGAATAAGAACACTAATGAGTTTATTCAA 
GTGATTAATAATAATTATAGCTCTTTTTTTATTGATTCTAGCCTTGGAAAGGATATTCTATATTGTAAGGATTTGA 
GGTTTAATTTTTTTGATAAAACTTTTGAAGATTTTACCTCATGTGTTCGTCTTTTTGATAAGGGCATGAGAGTATA 
CAATAGAGAGCTTGTTATTTCTTTGGGTATGTCAAAATATGATTTAGATGATGTTCACAATTATGTATATAAGTCT 
AAAGATATGGAAATGTTAAACAAGTTAAGCAATTCCAAAGTATTTTTTGTTAAAACTTATAAAGACAAACTACATC 
CGGTCTCTTCAGTTGTTAGAATTGATTCAATAGATATTCTAGAGATTGATAAAGCATTTGATAATTACATAAGTTT 
TTATTATGTCGAAAAAAATTCAAATCTTTTTTTTAAAGTTGGCTGA 

t739 .nt 

TGTTGTTTTGCTTGTTCTTGCGACATAAATTATCCGGAGATAAAAGAGCTTGATTATAAGATAAATTATTATTTTA 
CTGAAAATCGCTTAGATTACTCTATGAGTTTTGATTTTGCAATTAAAGTTATAAATTCAAAAGATGTTTTTAAATT 
ATCAATAGAGAATAAGAACACTAATGAGTTTATTCAAGTGATTAATAATAATTATAGCTCTTTTTTTATTGATTCT 
AGCCTTGGAAAGGATATTCTATATTGTAAGGATTTGAGGTTTAATTTTTTTGATAAAACTTTTGAAGATTTTACCT 
CATGTGTTCGTCTTTTTGATAAGGGCATGAGAGTATACAATAGAGAGCTTGTTATTTCTTTGGGTATGTCAAAATA 
TGATTTAGATGATGTTCACAATTATGTATATAAGTCTAAAGATATGGAAATGTTAAACAAGTTAAGCAATTCCAAA 
GTATTTTTTGTTAAAACTTATAAAGACAAACTACATCCGGTCTCTTCAGTTGTTAGAATTGATTCAATAGATATTC 
TAGAGATTGATAAAGCATTTGATAATTACATAAGTTTTTATTATGTCGAAAAAAATTCAAATCTTTTTTTTAAAGT 
TGGCTGA 

f742.aa 

MNKKHTNFSVLLLLIFLLILSFGGFGYYIYQSKLNDKNREIMLNEVKNSVIDRNYKKAYSVAKLLQDKYPQNEDIA 

MLTNTLAEIANSSPFESKDLQRDSANQILDKIKGQDNTKTNVNENFDIAFNNRYIKDSTITENYSDRNDDVGIEDE 

DISEFKKSKIPEKIKPNTNPKEEDQIIQSPNPKLSVNDQKNLFNLEKLKKNLSGKSNSENILNDSQKIENDKQNTN 

LSKEKNSENILKTPDNSKYSNNNNTTSLKKISSNSQKESELSPPSQTIIGKIYRPYSYLIKKELYEILDDINTGRV 

TIX3KNRLKELIKKGLSNKFQKVNELIENSKNKEASNLLLTLIKKDIEPNLINIPKDPYKKEIFQLDKEDKKPQYLE 

DLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQ 

AIKDLJ^EFLKl^PNDAQASKTl^QANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLK^ 

AQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDU^EFXKNNPNDAQASKTi^QANKIQHLEDLKSKVHSI^ 

DLENTKSRQQAIKDLNEFXKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDI^ 

PNDAQASKTI^QANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDIJ^EFXKNNPNDAQASKTI^ 

AEl^YEKIIKLTNTQEDHYKI^IIRFKLKKYEHSIESFDQTIKLDPKHKKALHNKGIAIiMMLl^ 
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QIDKNYGTAYYQKGIAEEKNGDMQQAFASFKNAYNLDKNPNYALKAGIVSNNLGNFKQSEEYLNFFNANAKK 
AIYNLSIAKFENNKLEESLETINKAIDLJSrPEKSEYLYLKASINLKKENYQNAISLYSLVIEKNPENTSAYI 
YEKSGNKSQAISTLEKIINKNNKLALNNLGILYKKEKNYQKAIEIFEKAIINSDIE^ 
LLREYTKLKPNNPEALHALGIIEYNENNNDQTLRELIKKFPNYKKNENIKKIIGI 

t742 .aa 

KLNDKNREIMLNEVKNSVIDRNYKKAYSVAKLLQDKYPQNEDIAMLTNTLAEIANSSPFESKDL^ 
KGQD 

NTKTNVNENFDIAFNNRYIKDSTITENYSDRNDDVGIEDEDISEFKKSKIPEKIKPNTNPKEEDQIIQSPNPKLSV 

NDQKNLFNLEKLKKNLSGKSNSENILNDSQKIENDKQNTNLSKEKNSENILKTPDNSKYSlSnsnsnsrTTS 

KESELSPPSQTIIGKIYRPYSYLIKKELYEILDDINTGRVTLGKNRLKELIKKGLSNKFQKVNELIENSKNKEASN 

LLLTLIKKDIEPNLINIPKDPYKKEIFQLDKEDKKPQYLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPN 

DAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKS 

KVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKD 

IJSTEFXKNNPNDAQASKTIJ^QANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFXKNNPNDAQ 

KIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLEN 

TKSRQQAIKDLNEFXKNNPNDAQASKTLAQAYENNGDLLKAENAYEKIIKLTNTQEDHYKLGIIRFKLKKYEHSIE 

SFDQTIKLDPKHKKALHNKGIALmLNKNKKAIESFEKAIQIDKNYGTAYYQKGIAEEKNGDMQQAFASFKNAYNL 

DKNPNYALKAGIVSNNLGNFKQSEEYLNFFNANAKKPNEIAIYNLSIAKFENNKLEESLETINKAIDLNPEKSEYL 

YLKASINLKKENYQNAISLYSLVIEKNPENTSAYINLAKAYEKSGNKSQAISTLEKIINKNNKLALNNLGILYKKE 

KNYQKAIEIFEKAIINSDIEAKYNLATTLIEINDNTRAKDLLREYTKLKPITOPEALHALGIIEYNENI^ 

IKKFPNYKKNENI KKI IGI 

f742 .nt 

ATGAATAAAAAACATACAAATTTTTCGGTATTATTGCTTTTAATTTTCTTACTTATCTTATCATTTGGGGGCTTTG 
GTTACTATATATATCAAAGCAAATTAAATGACAAAAATCGAGAAATAATGCTAAACGAAGTTAAAAATAGCGTAAT 
AGATCGAAACTATAAAAAAGCATATTCTGTTGCAAAACTTCTGCAAGACAAATACCCCCAAAATGAAGACATTGCA 
ATGCTTACAAATACACTAGCAGAAATTGCCAACAGTAGTCCTTTTGAATCAAAAGACTTGCAAAGAGATTCTGCTA 
ATCAAATCTTAGACAAGATCAAAGGTCAAGACAATACAAAAACAAATGTAAACGAAAATTTTGATATAGCATTTAA 
TAATAGATACATTAAAGACAGCACAATAACAGAAAACTACTCTGACAGAAACGATGATGTTGGCATTGAAGATGAA 
GACATATCTGAATTTAAAAAAAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAAAAGAAGAAGACCAAA 
TAATACAATCTCCAAATCCGAAATTAAGTGTTAATGACCAAAAAAATTTATTTAATTTGGAAAAACTAAAATW^AAA 
TTTAAGTGGAAAATCAAATAGTGAAAATATTTTAAACGATTCTCAAAAAATAGAAAATGATAAGCAAAACACAAAT 
TTATCCAAAGAAAAAAATTCGGAGAATATTTTAAAAACTCCGGACAACAGTAAATATTCAAACAATAACAATACTA 
CATCTTTAAAAAAAATTTCTTCAAATTCCCAAAAAGAAAGTGAGCTTTCTCCACCCAGTCAAACAATAATAGGGAA 
AATTTATAGGCCATATAGCTACTTGATAAAAAAAGAGCTCTATGAAATATTAGACGATATTAATACCGGAAGAGTC 
ACACTTGGAAAAAACAGATTAAAAGAATTAATTAAAAAAGGTCTAAGCAACAAATTTCAAAAAGTAAATGAATTGA 
TTGAAAATTCAAAAAATAAAGAAGCTTCAAATTTACTATTAACCTTAATAAAAAAAGATATTGAACCAAATCTCAT 
TAATATACCAAAAGATCCTTACAAAAAAGAAATTTTTCAATTAGATAAAGAAGACAAAAAGCCTCAGTACCTAGAG 
GACCTTAAATCTAAAGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGG 
ATCTAAACGAATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACA 
ACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCC/^CAA 
GCCATTAAGGATCTAAACGAATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTA 
ATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATC 
ACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTA 
GCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAA 
ACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTA 
AAACTTTAGCTCAAGCTAATAAAATACAACACCTGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGAT 
CTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAG 
GCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAAC 
CCATTGATCTTGAAAACACAAAAT 

CACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTT 

AGCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTC 

AACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCCAGGCCTC 
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TAAAACTTTAGCTCAAGCTTATGAAAACAATGGAGATTTGCTAAAAGCAGAAAATGCATACGAAAAAATTATCAAA 

CTCACAAATACCCAAGAAGATCACTATAAACTTGGAATCATTAGATTCAAGCTTAAAAAGTATGAACACTCAATAG 

AATCATTTGATCAAACAATAAAACTCGACCCAAAACATAAAAAAGCACTTCATAACAAAGGAATAGCTTTAATGAT 

GCTAAATAAAAACAAAAAAGCAATAGAATCTTTTGAGAAAGCAATACAAATTGATAAAAATTATGGCACCGCCTAC 

TACCAAAAAGGAATAGCAGAAGAAAAAAATGGCGATATGCAACAAGCATTTGCAAGCTTTAAAAATGCCTACAATC 

TCGACAAAAACCCCAATTATGCATTAAAAGCAGGAATAGTATCAAATAACTTGGGCAACTTCAAACAAAGTGA^ 

GTATTTAAATTTTTTTAATGCCAATGCAAAAAAACCTAACGAAATTGCTATTTACAACCTATCAATAGCAAAAT^ 

GAAAACAATAAACTTGAAGAATCTCTTGAAACAATAAACAAAGCCATAGATTTAAATCCAGAAAAAAGTGAATATT 

TATATTTAAAAGCATCTATAAATCTTAAAAAAGAAAATTACCAAAATGCTATATCACTTTACAGCTTAGTAATTGA 

AAAAAACCCTGAAAATACTTCAGCCTATATAAACCTGGCAAAAGCATATGAAAAATCAGGAAATAAAAGTCAAGCA 

ATCTCAACTCTTGAAAAGATAATAAACAAAAATAATAAATTAGCCTTAAACAATCTTGGGATACTTTACAAAAAAG 

AAAAAAATTATCAAAAAGCAATTGAAATTTTTGAAAAAGCAATAATCAATTCAGATATTGAAGCAAAATATAATCT 

TGCAACCACTCTAATTGAAATTAATGATAACACAAGAGCTAAAGACCTTCTAAGAGAATATACAAAATTAAAACCA 

AACAATCCAGAGGCCTTACATGCACTAGGAATAATAGAATATAATGAAAATT^CAATGATCAAACACTAAGAGAAC 

TTATAAAAAAATTTCCAAATTACAAAAAAAATGAAAATATTAAAAAAATAATAGGAATATAA 

t742 .nt 

AAATTAAATGACAAAAATCGAGAAATAATGCTAAACGAAGTTAAAAATAGCGTAATAGATCGAAACTATAAAAAAG 
CATATTCTGTTGCAAAACTTCTGCAAGACAAATACCCCCAAAATGAAGACATTGCAATGCTTACAAATACACTAGC 
AGAAATTGCCAACAGTAGTCCTTTTGAATCAAAAGACTTGCAAAGAGATTCTGCTAATCAAATCTTAGACAAGATC 
AAAGGTCAAGACAATACAAAAACAAATGTAAACGAAAATTTTGATATAGCATTTAATAATAGATACATTAAAGACA 
GCACAATAACAGAAAACTACTCTGACAGAAACGATGATGTTGGCATTGAAGATGAAGACATATCTGAATTTAAAAA 
AAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAAAAGAAGAAGACCAAATAATACAATCTCCAAATCCG 
AAATTAAGTGTTAATGACCAAAAAAATTTATTTAATTTGGAAAAACTAAAAAAAAATTTAAGTGGAAAATCAAATA 
GTGAAAATATTTTAAACGATTCTCAAAAAATAGAAAATGATAAGCAAAACACAAATTTATCCAAAGAAAAAAATTC 
GGAGAATATTTTAAAAACTCCGGACAACAGTAAATATTCAAACAATAACAATACTACATCTTTAAAAAAAATTTCT 
TCAAATTCCCAAAAAGAAAGTGAGCTTTCTCCACCCAGTCAAACAATAATAGGGAAAATTTATAGGCCATATAGCT 
ACTTGATAAAAAAAGAGCTCTATGAAATATTAGACGATATTAATACCGGAAGAGTCACACTTGGAAAAAACAGATT 
AAAAGAATTAATTAAAAAAGGTCTAAGCAACAAATTTCAAAAAGTAAATGAATTGATTGAAAATTCAAAAAATAAA 
GAAGCTTCAAATTTACTATTAACCTTAATAAAAAAAGATATTGAACCAAATCTCATTAATATACCAAAAGATCCTT 
ACAAAAAAGAAATTTTTCAATTAGATAAAGAAGACAAAAAGCCTCAGTACCTAGAGGACCTTAAATCTAAAGTTCA 
TTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTGAAA 
AACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAGGACCTTAAAT 
CTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGA 
ATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAG 
GACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGG 
ATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACA 
ACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAA 
GCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTAAAACTTTAGCTCAAGCTAAT 
AAAATACAACACCTGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACG 
CCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTAAAACTTTAGCTCA 
AGCTAATAAAATACTVACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACA 
AAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAA 
CTTTAGCTCAAGCTAATAAAATAC 

AACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACA 

AGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTAGCTCAAGCTT 

ATGAAAACAATGGAGATTTGCTAAAAGCAGAAAATGCATACGAAAAAATTATCAAACTCACAAATACCCAAGAAGA 

TCACTATAAACTTGGAATCATTAGATTCAAGCTTAAAAAGTATGAACACTCAATAGAATCATTTGATCAAACAATA 

AAACTCGACCCAAAACATAAAAAAGCACTTCATAACAAAGGAATAGCTTTAATGATGCTAAATAAAAACAAAA^ 

CAATAGAATCTTTTGAGAAAGCAATACAAATTGATAAAAATTATGGCACCGCCTACTACCAAAAAGGAATAGCAGA 

AGAAAAAAATGGCGATATGCAACAAGCATTTGCAAGCTTTAAAAATGCCTACAATCTCGACAAAAACCCCAATTAT 

GCATTAAAAGCAGGAATAGTATCAAATAACTTGGGCAACTTCAAACAAAGTGAAGAGTATTTAAATTTTT^ 

CCAATGCAAAAAAACCTAACGAAATTGCTATTTACAACCTATCAATAGCAAAATTTGAAAACAATA^ 

ATCTCTTGAAACAATAAACAAAGCCATAGATTTAAATCCAGAAAAAAGTGAATATTTATATTTAA^ 

AATCTTAAAAAAGAAAATTACCAAAATGCTATATCACTTTACAGCTTAGTAATTGAAAAAAACCCTGAAA^ 
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CAGCCTATATAAACCTGGCAAAAGCATATGAAAAATCAGGAAATAAAAGTCAAGCAATCTCAACTCTTGAAAAGAT 
AATAAACAAAAATAATAAATTAGCCTTAAACAATCTTGGGATACTTTACAAAAAAGAAAAAAATTATCT^AAAAGCA 
ATTGAAATTTTTGAAAAAGCAATAATCAATTCAGATATTGAAGCAAAATATAATCTTGCAACCACTCTAATTGAAA 
TTAATGATAACACAAGAGCTAAAGACCTTCTAAGAGAATATACAAAATTAAAACCAAACAATCCAGAGGCCTTACA 
TGCACTAGGAATAATAGAATATAATGAAAATT^CAATGATCAAACACTAAGAGAACTATAAAAAAATTTCCAAATT 
ACAAAAAAAATGAAAATATTAAAAAAATAATAGGAATATAA 

£743. aa 

MRIYLFIJJKNYKIFILFLILIIJ^SKIAYSQRLIRIGKEEMKNKNYIQAIETLSDAIKKYPKVQL^ 
NQLTEAEGALLDGIAVGGEIDYILYYELGNIMFNRGEGYYPLAIKYYSNSIKSRPNYDSALLNRANAYVQQGKITS 
KEKEYQKAWDSYTMAIHDYSQFITLRSKTEKKDSILLIISYLRNEKINLEQLDKSLKGRTEHIVYAKEDKNQILKD 
SFKDNLETNSLIELEKLNWQEELYIDE 

t743 .aa 

YSQRLIRIGKEEMKNKNYIQAIETLSDAIKKYPKVQLGYYFLSIAYRENNQLTEAEGALLDGIAVGGEIDYILYYE 
LGNIMFNRGEGYYPLAIKYYSNSIKSRPNYDSALLNRANAYVQQGKITSKEKEYQKAWDSYTMAIHDYSQFITLRS 
KTEKKDSILLIISYLRNEKINLEQLDKSLKGRTEHIVYAKEDKJSTQILKDSFKDNLETNSLIELEKLNWQEELYIDE 

£743 .nt 

ATGAGGATTTATTTATTTTTAAATAAAAATTACAAGATTTTTATTTTATTTTTAATTTTAATATTAAATTCAAAAT 
TGGC 

ATATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAAACAAAAATTACATTCAAGCAATCGAAACACTA 
AGTGATGCTATTAAAAAATATCCAAAAGTACAACTCGGCTATTACTTTTTATCAATAGCATACAGAGAAAATAATC 
AACTAACAGAAGCAGAAGGAGCATTGCTCGATGGAATTGCAGTAGGGGGTGAAATCGACTACATACTATATTATGA 
ATTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACTATCCTTTAGCAATAAAATATTATTCTAATTCTATTAAA 
AGTAGACCTAATTATGACAGTGCGCTACTAAACAGAGCTAATGCCTATGTTCAACAGGGCAAAATAACTTCTAAAG 
AAAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTCTCAATTTATTACCCTTAGATC 
AAAAACAGAAAAAAAAGACAGCATTTTGCTTATAATAAGCTATTTAAGAAATGAAAAAATTAATCTTGAACAACTT 
GACAAAAGTTTGAAGGGGCGAACCGAGCATATTGTATACGCAAAAGAAGATAAAAATCAAATACTTAAAGATAGTT 
TTAAAGACAACCTAGAAACAAATTCTTTAATTGAGCTAGAAAAACTTAATTGGCAAGAGGAGTTATACATAGATGA 
ATAA 

t743 .nt 

TATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAAACAAAAATTACATTCAAGCAATCGAAACACTAA 

GTGATGCTATTAAT^AAATATCCAAAAGTACAACTCGGCTATTACTTTTTATCAATAGCATACAGAGAAAATAATCA 

ACTAACAGAAGCAGAAGGAGCATTGCTCGATGGAATTGCAGTAGGGGGTGAAATCGACTACATACTATATTATGAA 

TTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACTATCCTTTAGCAATAAAATATTATTCTAATTCTATTAAAA 

GTAGACCTAATTATGACAGTGCGCTACTAAACAGAGCTAATGCCTATGTTCAACAGGGCAAAATAACTTCTAAAGA 

AAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTCTCAATTTATTACCCTTAGATCA 

AAAACAGAAAAAAAAGACAGCATTTTCCTTATAATAAGCTATTTAAGAAATGAAAAAATTAATCTTGAACAA 

ACAAAAGTTTGAAGGGGCGAACCGAGCATATTGTATACGCAAAAGAAGATAAAAATCAAATACTTAAAGATAGTTT 

TAAAGACAACCTAGAAACAAATTCTTTAATTGAGCTAGAAAAACTTAATTGGCAAGAGGAGTTATACATAGATGAA 

TAA 

£748. aa 

MKFIINLLLSTIKIITFTVIVCLTILSIFQPIYILKENEISITTRLGKIQRTENLAGLKYKIPLIENVQIFPKIIL 

RWTCEPQRIPTGGEEKQLIWIIXrTARWKIADINKFYTTIKTMSRAYVRIDAAIEPAVRGVIAKYPLLEIIRSSNDP 

IQRIiSNGILTPQETKINGIYKITKGRKIIEKEIIRIANNNTKDIGIEIVDVLIRKVTYDPSLIESVNNRMISE 

lAEEQRSIGIJVEKTEILGSIEKEKLKILSEAKATAAKIKAEGDREAAKIYSNAYGKNIEFYKFWQALESYKAV^ 

KRKIFSTDMDFFQYLHKRN 
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t748 .aa 

IFQPIYILKENEISITTRLGKIQRTENLAGLKYKIPLIEWQIFPKIILRWDGEPQRIPTGGEEKQLIWIDTTARW 
KIADINKFYTTIKTMSRAYVRIDAAIEPAVRGVIAKYPLLEIIRSSNDPIQRLSNGILTPQETKINGIYKITKGRK 
IIEKEIIRIAN^^OTKDIGIEIVDVLIRKVTYDPSLIESV^^S^RMISERQQIAEEQRSIGI^ 
LSEAKATAAKIKAEGDREAAKIYSNAYGKNIEFYKFWQALESYKAVLKDKRKIFSTDMDFFQYLHKRN 

f748 .nt 

ATGAAATTTATAATAAATCTTTTATTATCTACTATAAAGATTATAACCTTTACAGTAATAGTTTGCTTGACTATTT 
TGTCTATTTTCCAGCCAATTTATATTTTGAAAGAAAATGAAATTTCAATAACCACTCGACTTGGAAAAATTCAAAG 
AACTGAAAATTTAGCTGGACTTAAATATAAAATACCATTAATTGAAAATGTGCAAATATTTCCCAAAATCATTCTT 
AGATGGGATGGAGAACCTCAAAGAATCCCAACAGGAGGGGAAGAAAAGCAATTAATATGGATTGATACAACTGCTA 
GATGGAAAATTGCAGACATAAATAAATTTTACACAACAATAAAAACAATGAGTAGAGCTTACGTTAGAATTGATGC 
AGCAATTGAACCTGCTGTTAGGGGGGTTATTGCAAAATACCCTTTGCTTGAAATTATAAGAAGCTCAAACGATCCT 
ATTCAACGTTTGTCTAATGGAATACTCACCCCACAAGAAACAAAAATTAACGGTATTTATAAAATAACAAAAGGAC 
GAAAGATAATCGAAAAAGAAATAATTCGTATAGCAAACAACAATACCAAAGATATTGGAATTGAAATTGTAGACGT 
ACTAATAAGAAAAGTTACTTATGACCCAAGCCTTATTGAATCTGTAAACAACAGAATGATCTCAGAAAGACAACAA 
ATCGCAGAAGAACAAAGAAGCATAGGATTAGCTGAAAAAACAGAAATTCTTGGAAGCATAGAAAAAGAAAAACTGA 
AAATATTAAGTGAAGCAAAAGCCACTGCTGCAAAAATAAAAGCCGAAGGGGATAGAGAAGCCGCAAAAATTTATTC 
AAATGCATATGGCAAAAATATTGAATTTTACAAATTCTGGCAGGCATTAGAAAGCTATAAAGCAGTATTAAAAGAT 
AAAAGAAAAATTTTCTCAACAGACATGGATTTCTTTCAATATCTTCACAAAAGAAATTGA 

t748 -nt 

ATTTTCCAGCCAATTTATATTTTGAAAGAAAATGAAATTTCAATAACCACTCGACTTGGAAAAATTCAAAGAACTG 
AAAATTTAGCTGGACTTAAATATAAAATACCATTAATTGAAAATGTGCAAATATTTCCCAAAATCATTCTTAGATG 
GGATGGAGAACCTCAAAGAATCCCAACAGGAGGGGAAGAAAAGCAATTAATATGGATTGATACAACTGCTAGATGG 
AAAATTGCAGACATAAATAAATTTTACACAACAATAAAAACAATGAGTAGAGCTTACGTTAGAATTGATGCAGCAA 
TTGAACCTGCTGTTAGGGGGGTTATTGCAAAATACCCTTTGCTTGAAATTATAAGAAGCTCAAACGATCCTATTCA 
ACGTTTGTCTAATGGAATACTCACCCCACAAGAAACAAAAATTAACGGTATTTATAAAATAACAAAAGGACGAAAG 
ATAATCGAAAAAGAAATAATTCGTATAGCAAACAACAATACCAAAGATATTGGAATTGAAATTGTAGACGTACTAA 
TAAGAAAAGTTACTTATGACCCAAGCCTTATTGAATCTGTAAACAACAGAATGATCTCAGAAAGACAACAAATCGC 
AGAAGAACAAAGAAGCATAGGATTAGCTGAAAAAACAGAAATTCTTGGAAGCATAGAAAAAGAAAAACTGAAAATA 
TTAAGTGAAGCAAAAGCCACTGCTGCAAAAATAAAAGCCGAAGGGGATAGAGAAGCCGCAAAAATTTATTCAAATG 
CATATGGCAAAAATATTGAATTTTACAAATTCTGGCAGGCATTAGAAAGCTATAAAGCAGTATTAAAAGATAAAAG 
AAAAATTTTCTCAACAGACATGGATTTCTTTCAATATCTTCACAAAAGAAATTGA 

£764. aa 

MSGPKKLAIIALLVISIQGCKESSIIEKQFNYAIIFSDATEYFFEIQTTPFIKNEILFINDKNLEIIKDKLKTTKK 
ILLTHKSNNEILNNEILKEKIFYLSKIKFSLKKSIDFLLNEKSIDLQKTLLFRDKSLNNEDLEYLEKKGKEKNVNI 
TLINEKNISYIQTFITSQIKTIILFSLRDlSnsriILKKILNSPFSKNIKFVLIGNTRKDLKIIKLKYIITLKEPDLIK 
lAKDVEKDFQYEFNIYKQ 

f7 64.aa 

EKQFNYAIIFSDATEYFFEIQTTPFIKNEILFINDKNLEIIKDKLKTTKKILLTHKSNNEIIJSINEIL^ 

IKFSLKKSIDFLLNEKSIDLQKTLLFRDKSLNNEDLEYLEKKGKEKNVNITLINEKNISYIQTFITSQIKTIILFS 

LRDNNIILKKILNSPFSKNIKFVLIGNTRKDLKIIKLKYIITLKEPDLIKIAKDVEKDFQYEFNIYKQ 

f764.nt 

ATGTCTGGCCCTAAAAAACTTGCTATAATAGCGCTCTTAGTAATTTCAATACAAGGATGCAAAGAATCTTCTA^ 
TTGAAAAACAATTTAATTATGCAATAATTTTTTCAGATGCAACTGAATATTTTTTT^ 

CATAAAAAACGAT^TACTATTTATAAATGACAAAAATTTAGAAATTATAAAAGACAAGCTTAAAACAACAAAAAAA 
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ATACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAAATTCTAAAAGAGAAAATTTTTTATCTATCAA 
AAATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTGCTTAACGAAAAATCAATAGATTTGCAAAAAACATTACT 
ATTTAGAGACAAATCTCTAAATAACGAAGACCTTGAATACTTGGAAAAAAAAGGCAAAGAAAAAAATGTCAATATT 
ACTCTAATAAACGAAAAAAACATATCCTATATTCAAACATTCATTACTTCTCAAATAAAAACAATAATATTATTCT 
CTTTAAGAGATAATAATATTATTTTAAAAAAGATACTAAATTCGCCTTTTTCTAAAAATATAAAATTTGTATTAAT 
TGGCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAATATATAATCACCCTTAAAGAGCCTGATTTGATAAAA 
ATAGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTTAACATTTATAAACAATAA 

t764,nt 

GAAAAACAATTTAATTATGCAATAATTTTTTCAGATGCAACTGAATATTTTTTTGAAATTCAAACAACTCCATTCA 
TAAAAAACGAAATACTATTTATAAATGACAAAAATTTAGAAATTATAAAAGACAAGCTTAAAACAACAAAAAAAAT 
ACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAAATTCTAAAAGAGAAAATTTTTTATCTATCAAAA 
ATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTGCTTAACGAAAAATCAATAGATTTGCAAAAAACATTACTAT 
TTAGAGACAAATCTCTAAATAACGAAGACCTTGAATACTTGGAAAAAAAAGGCAAAGAAAAAAATGTCAATATTAC 
TCTAATAAACGAAAAAAACATATCCTATATTCAAACATTCATTACTTCTCAAATAAAAACAATAATATTATTCTCT 
TTAAGAGATAATAATATTATTTTAAAAAAGATACTAAATTCGCCTTTTTCTAAAAATATAAAATTTGTATTAATTG 
GCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAATATATAATCACCCTTAAAGAGCCTGATTTGATAAAAAT 
AGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTTAACATTTATAAACAATAA 

£770. aa 

MINFSKSFFYPLPIGKIFVLSGDMGSGKTSFLKGLALNLGISYFTSPTYNIVNVYDFINFKFYHIDLYRVSSLEEF 
ELVGGLEILMDLDSIIAIEWPQIALSIVPKDRLFSLTFKIVGSGRWELNG 

t770.aa 

KTSFLKG3JALNLGISYFTSPTYNIVIWYDFINFKFYHIDLYRVSSLEEFELVGGLEILMDLDSIIAIEWPQIALSI 
VPKDRLFSLTFKIVGSGRWELNG 

f770.nt 

ATGATAAATTTTTCCAAATCTTTTTTTTATCCTTTGCCAATTGGTAAAATATTTGTTTTAAGTGGTGACATGGGAT 
CTGGAAAAACTAGTTTTTTAAAGGGACTTGCCCTTAACCTTGGAATTTCTTATTTTACAAGTCCAACTTATAACAT 
TGTTAATGTTTATGATTTTATAAATTTTAAATTTTATCATATTGATTTATATCGGGTGTCTTCTTTGGAAGAATTT 
GAGCTTGTTGGGGGATTGGAAATACTTATGGATCTTGACTCGATTATTGCTATTGAATGGCCACAAATTGCTTTGA 
GCATTGTTCCAAAAGATAGATTATTTTCTTTAACTTTTAAAATAGTAGGTTCAGGCAGGGTTGTAGAACTTAATGG 
TTAA 

t770.nt 

AAAACTAGTTTTTTAAAGGGACTTGCCCTTAACCTTGGAATTTCTTATTTTACAAGTCCAACTTATAACATTGTTA 

ATGTTTATGATTTTATAAATTTTAAATTTTATCATATTGATTTATATCGGGTGTCTTCTTTGGAAGAATTTGA 

TGTTGGGGGATTGGAAATACTTATGGATCTTGACTCGATTATTGCTATTGAATGGCCACA^ 

GTTCCAAAAGATAGATTATTTTCTTTAACTTTTAAAATAGTAGGTTCAGGCAGGGTTGTAGAACTTAATGGTTAA 
£790. aa 

MNTKATTPLLLLFLIQSrAFSSEIFEFKYIKGSKFRLEGTDNQKIYFNGHYNSSSNTNIQISSEIKDIKENFASIK 

AFFRILKRENINEPYLLNEEFEEIFSVNKQGEYTIGANQKRPSVRGIPRFPKTPIKINEKWSYLAEEYIEASKIDK 

SIKDFVVKFNVNYEYKGKEEHNGKHYHIILSNYESQYNVKNISFYQKVDQKIYFDNEIGNT^ 

NQHFKMIGNSLGRIVSIELP^roNLIETEVENYIREKKIKAIEVEKNNKGINLSFDIEFYPNSFQILQK^ 

AKLLEKFKKNNILIEGHTEQFGLEEEMHELSEKRARAIGNYLIKMKVKDKDQILFKGWGSQKPK^ 

RVEITILNN 



t790.aa 



wo 98/59071 



PCT/US98/12718 



130 

TABLE 1. Nucleotide and Amino Acid Sequences 

SEIFEFKriiriKGSKFRLEGTDNQKiyFNGHYNSSSNTNIQISSEIKDIKENFASIKAFFRILKRENINEPYLLNEEF 

EEIFSVNKQGEYTIGANQKRPSWGIPRFPKTPIKINEKWSYIAEEYIEASKIDKSIKDFWKFNVNYEYK^ 

NGKHYHIILSl^ESQYNVKNISFYQKVDQKIYFDNEIGNTYKYSDKYIFEINQNNNQHFKMIGNSLGRIV^ 

DNLIETEVEl^IREKKIKAIEVEKNNKGINLSFDIEFYPNSFQILQKEYKKIDLIAKLLEKFKKNNILIEGHTEQF 

GLEEEMHELSEKRARAIGNYLIKMKVKDKDQILFKGWGSQKPKYPKSSPLKAKNRRVEITILNN 

f790.nt 

ATGAATACCAAGGCGACTACACCATTGTTGTTATTATTTTTAATTCAAAGCTTAGCTTTTTCTTCTGAAATCTTTG 
AATTTAAATACATTAAAGGTTCAAAGTTTAGATTAGAAGGCACAGATAATCAAAAAATATATTTCAATGGCCATTA 
TAATTCAAGCTCTAATACCAATATTCAAATTTCAAGTGAAATAAAAGACATAAAAGAAAACTTTGCAAGCATTAAA 
GCTTTTTTTAGAATCTTAAAAAGAGAAAATATTAATGAACCTTACCTATTAAATGAAGAGTTTGAAGAAATCTTCA 
GCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGACCTTCTGTTAGAGGTATTCCAAGATTCCC 
AAAAACACCAATCAAAATAAATGAAAAATGGTCATATCTTGCAGAAGAATATATAGAAGCGTCAAAAATAGACAAA 
AGTATAAAAGATTTCGTTGTAAAATTTAATGTTAACTACGAATATAAAGGCAAAGAAGAGCACAATGGCAAGCATT 
ACCACATAATTCTTTCGAATTATGAATCACAATACAATGTAAAAAACATCTCTTTCTATCAAAAAGTAGACCAAAA 
AATTTATTTTGATAATGAAATTGGCAATACATATAAATACAGCGATAAATATATATTTGAAATAAATCAGAACAAC 
AACCAACATTTTAAAATGATTGGAAACTCTCTTGGCAGAATAGTTTCAATTGAGCTTCCAAATGATAATCTTATTG 
AAACTGAGGTTGAAAATTACATCCGAGAAAAAAAAATAAAAGCTATTGAAGTTGAAAAAAACAATAAAGGTATTAA 
TTTAAGCTTTGACATTGAATTTTATCCTAACTCATTTCAAATACTACAAAAAGAATATAAAAAAATTGACCTTATA 
GCTAAACTTCTTGAAAAATTTAAAAAAAATAACATACTAATAGAAGGACATACTGAGCAATTTGGATTGGAAGAAG 
AGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATTGGAAATTATTTAATAAAAATGAAAGTAAAAGACAAAGA 
CCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAAATATCCTAAGTCCTCCCCATTAAAGGCTAAAAATAGG 
CGAGTAGAAATTACAATATTAAATAACTAA 

t790.nt 

TCTGAAATCTTTGAATTTAAATACATTAAAGGTTCAAAGTTTAGATTAGAAGGCACAGATAATCAAAAAATATATT 

TCAATGGCCATTATAATTCAAGCTCTAATACCAATATTCAAATTTCAAGTGAAATAAAAGACATAAAAGAAAACTT 

TGCAAGCATTAAAGCTTTTTTTAGAATCTTAAAAAGAGAAAATATTAATGAACCTTACCTATTAAATGAAGAGTTT 

GAAGAAATCTTCAGCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGACCTTCTGTTAGAGGTA 

TTCCAAGATTCCCAAAAACACCAATCAAAATAAATGAAAAATGGTCATATCTTGCAGAAGAATATATAGAAGCGTC 

AAAAATAGACAAAAGTATAAAAGATTTCGTTGTAAAATTTAATGTTAACTACGAATATAAAGGCAAAGAAGAGCAC 

AATGGCAAGCATTACCACATAATTCTTTCGAATTATGAATCACAATACAATGTAAAAAACATCTCTTTCTATCAAA 

AAGTAGACCAAAAAATTTATTTTGATAATGAAATTGGCAATACATATAAATACAGCGATAAATATATATTTGAAAT 

AAATCAGAACAACAACCAACATTTTAAAATGATTGGAAACTCTCTTGGCAGAATAGTTTCAATTGAGCTTCCAAAT 

GATAATCTTATTGAAACTGAGGTTGAAAATTACATCCGAGAAAAAAAAATAAAAGCTATTGAAGTTGAAAAAAACA 

ATAAAGGTATTAATTTAAGCTTTGACATTGAATTTTATCCTAACTCATTTCAAATACTACAAAAAGAATATAAAAA 

AATTGACCTTATAGCTAAACTTCTTGAAAAATTTAAAAAAAATAACATACTAATAGAAGGACATACTGAGCAATTT 

GGATTGGAAGAAGAGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATTGGAAATTATTTAATAAAAATGA^ 

TAAAAGACAAAGACCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAAATATCCTAAGTCCTCCCCATTAAA 

GGCTAAAAATAGGCGAGTAGAAATTACAATATTAAATAACTAA 

£792 .aa 

MKIFIYWWIFFFSVPKVFSIYSLTDEEFFKKYSLFFVHKGFLSKNWGKITWQWGINSRWVYPFYKLVPSRIT 

SIYEDVYSSSSFLTTSNNLYVSYDYSKNFRKLVGIDKFNSGAYITSSAFSQGDYKRIAIGTAIHGIYLSVNGAISF 

KNIJORLIPQIYLGAGYYDIISAIEFSKEETNNLYFSSGVYGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKNVE 

KILVRTYDNHFYSYINGQWVFIGKLSLQDQDFFEKSQRMQLAKNKGSIYLTAYTLRNKKAVDERFKFIKDSGMNAV 

VIDFKDDNGNLTYSSKLSLPNKLRSVKNFIDVPYILKKAKELGIYVIARCWFKDSKLYYYDNFKH^ 

WAHLIKKATDSSGLVKYVQVEJiWVDIFSPATWEYNISIAKEIQSFGVDEIQFDYIRFPSIXS 

VDALESFLIMAREQLYVPISVDIYGYNGWFPTNSIGQNISMLSDYVDVISPMFYPSHYTDDFLPSNFYYTKi^ 

YKEGSDRALAFSLIX;WIRPYVQAFLLGKERLVDDEIYLEYLKFQLKGIKESFGSGFSLV«aASNVYYMI 

LDSF 
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t792 .aa 

lYSLTDEEFFKKYSLFFVHKGFLSKNVNGKITKVQVNGINSRWVYPFYKLVPSRITSIYEDVYSSSSFLTTSNNLY 

VSYDYSKNFRKLVGIDKFNSGAYITSSAFSQGDYKRIAIGTAIHGIYLSVNGAISFKNLNRLIPQIYLGAGYYDII 

SAIEFSKEETNNLYFSSGWGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKNVEKILVRTYDNHFYSYINGQW 

FIGKLSLQDQDFFEKSQRMQUUCNKGSIYLTAYTLRNKKAVDERFKFIKDSGMNAWIDFKDDNGNLTYSS 

NKLRSVKNFIDVPYILKKAKELGIYVIARCVWKIDSKLYYYDNFKHALWNKKTNKPWAHLIKK^ 

HWVDIFSPATWEYNISIAKEIQSFGVDEIQFDYIRFPSDGPVSIAISRl^KYEMQPVDALESFLIM^ 

VDIYGYNGWFPTNSIGQNISMLSDYVDVISPMFYPSHYTDDFLPSNFYTTKRAYRIYKEGSDRAIAFSLDGVVIRP 

YVQAFLLGKERLVDDEIYLEYLKFQLKGIKESFGSGFSLWNASNVYYMIKGSLKEYLDSF 

f792 .nt 

ATGAAAATTTTTATCTATTGGGTAGTTATTTTCTTCTTTTCTGTTTTCAAGGTTTTTAGTATATATTCATTAACCG 
ATGAAGAATTTTTTAAAAAATATAGTTTATTTTTTGTTCATAAAGGATTTTTAAGTAAAAATGTTAATGGGAAAAT 
AACCAAAGTTCAAGTCAATGGGATAAATTCTAGGTGGGTTTACCCTTTTTATAAGCTTGTTCCTAGTCGAATTACT 
TCTATTTATGAGGATGTTTATTCTTCAAGTTCATTTTTGACTACAAGTAACAATCTTTATGTTTCTTATGATTATT 
CAAAAAATTTTAGAAAATTAGTAGGAATTGATAAATTTAATAGCGGTGCATATATTACATCTAGTGCCTTTTCTCA 
AGGAGATTACAAGCGTATTGCTATTGGAACTGCGATTCATGGTATTTATCTTAGTGTTAATGGAGCTATTAGTTTT 
AAAAATTTAAATCGTTTGATTCCGCAGATTTATTTAGGTGCAGGATATTACGATATTATTAGTGCTATTGAATTTT 
CAAAAGAAGAGACAAATAATTTATATTTTTCCTCTGGAGTTTATGGAGATATTTTTTTAATTAGTCAGAAAAGTGG 
ATTTATTAAAAAAATATCTTTTCCTTTCAAAAAGCAAATAATACGTATTTTAGACTTATCTAGTAAGAATGTAGAA 
AAAATTTTAGTCAGAACATATGACAATCATTTTTATTCTTATATTAATGGGCAATGGGTATTTATTGGAAAATTAT 
CTTTGCAGGATCAGGATTTTTTTGAAAAATCACAAAGGATGCAGCTTGCTAAAAATAAAGGGTCTATTTATTTAAC 
AGCATATACATTGCGTAATAAGAAGGCAGTTGATGAAAGATTTAAATTTATTAAAGATTCAGGTATGAATGCTGTT 
GTAATTGATTTTAAAGATGATAATGGTAATTTGACTTATTCTAGCAAGCTTTCTTTGCCCAATAAGTTGAGATCTG 
TTAAAAACTTTATTGATGTTCCTTATATTCTTAAAAAAGCAAAAGAGCTTGGAATTTATGTTATTGCTAGATGTGT 
TGTATTTAAAGATTCAAAATTGTATTATTATGATAATTTTAAACACGCCCTTTGGAATAAAAAAACCAATAAACCT 
TGGGCTCATTTGATTAAAAAAGTTGATTCTAGTGGTCTTGTGAAATATGTACAAGTAGAGCATTGGGTAGATATTT 
TTTCTCCTGCTACTTGGGAATATAATATTTCTATCGCAAAAGAAATTCAATCTTTTGGAGTTGACGAGATACAATT 
TGATTATATTAGATTTCCATCAGATGGGCCTGTGTCTCTTGCAATCTCAAGAATGAATAAGTATGAGATGCAACCC 
GTTGATGCACTTGAATCTTTTTTGATTATGGCAAGAGAACAGCTTTATGTTCCTATTTCTGTTGATATTTATGGGT 
ACAATGGCTGGTTTCCTACTAATAGTATTGGGCAAAATATTTCAATGTTATCAGATTATGTTGACGTCATATCTCC 
TATGTTTTATCCTTCGCATTATACTGATGATTTTTTGCCAAGCAATTTTTATTACACAAAAAGAGCTTATAGGATT 
TATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTCTTTAGATGGGGTTGTTATTAGGCCTTATGTTCAAGCTTTTT 
TATTAGGAAAAGAAAGATTGGTGGATGACGAGATTTATTTGGAGTATTTAAAGTTTCAGCTTAAAGGAATTAAAGA 
GTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCTAATGTTTATTATATGATTAAAGGTAGTTTAAAAGAATAT 
TTAGATTCTTTTTA 

t792.nt 

ATATATTCATTAACCGATGAAGAATTTTTTAAAAAATATAGTTTATTTTTTGTTCATAAAGGATTTTTAAGTAAAA 

ATGTTAATGGGAAAATAACCAAAGTTCAAGTCAATGGGATAAATTCTAGGTGGGTTTACCCTTTTTATAAGCTTGT 

TCCTAGTCGAATTACTTCTATTTATGAGGATGTTTATTCTTCAAGTTCATTTTTGACTACAAGTAACAATCTTTAT 

GTTTCTTATGATTATTCAAAAAATTTTAGAAAATTAGTAGGAATTGATAAATTTAATAGCGGTGCATATATTACAT 

CTAGTGCCTTTTCTCAAGGAGATTACAAGCGTATTGCTATTGGAACTGCGATTCATGGTATTTATCTTAGTGTTAA 

TGGAGCTATTAGTTTTAAAAATTTAAATCGTTTGATTCCGCAGATTTATTTAGGTGCAGGATATTACGATATTATT 

AGTGCTATTGAATTTTCAAAAGAAGAGACAAATAATTTATATTTTTCCTCTGGAGTTTATGGAGATATTTTTTO 

TTAGTCAGAAAAGTGGATTTATTAAAAAAATATCTTTTCCTTTCAAAAAGCAAATAATACGTATTTTAGACTTATC 

TAGTAAGAATGTAGAAAAAATTTTAGTCAGAACATATGACAATCATTTTTATTCTTATATTAATGGGCAATGGGTA 

TTTATTGGAAAATTATCTTTGCAGGATCAGGATTTTTTTGAAAAATCACAAAGGATGCAGCTTC 

GGTCTATTTATTTAACAGCATATACATTGCGTAATAAGAAGGCAGTTGATGAAAGATTTAAATTTATTAAAGA 

AGGTATGAATGCTGTTGTAATTGATTTTAAAGATGATAATGGTAATTTGACTTATTCTAGCAAGCTTTCTTTG^ 

AATAAGTTGAGATCTGTTAAAAACTTTATTGATGTTCCTTATATTCTTAAAAAAGCAAAAGAGCTT^ 

TTATTGCTAGATGTGTTGTATTTAAAGATTCAAAATTGTATTATTATGATAATTTTAAACAC^ 

AAAAACCAATAAACCTTGGGCTCATTTGATTAAAAAAGTTGATTCTAGTGGTCTTGTGAAA 
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CATTGGGTAGATATTTTTTCTCCTGCTACTTGGGAATATAATATTTCTATCGCAAAAGAAATTCAATCTTTTG^ 

TTGACGAGATACAATTTGATTATATTAGATTTCCATCAGATGGGCCTGTGTCTCTTGCAATCTCAAGAATGAATAA 

GTATGAGATGCAACCCGTTGATGCACTTGAATCTTTTTTGATTATGGCAAGAGAACAGCTTTATGTTCCTATTTCT 

GTTGATATTTATGGGTACAATGGCTGGTTTCCTACTAATAGTATTGGGCAAAATATTTCAATGTTATCAGATTATG 

TTGACGTCATATCTCCTATGTTTTATCCTTCGCATTATACTGATGATTTTTTGCCAAGCAATTTTTATTACACAAA 

AAGAGCIT'ATAGGATTTATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTCTTTAGATGGGGTTGTTATTAGGCCT 

TATGTTCAAGCTTTTTTATTAGGAAAAGAAAGATTGGTGGATGACGAGATTTATTTGGAGTATTTAAAGTTTCAGC 

TTAAAGGAATTAAAGAGTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCTAATGTTTATTATATGATTAAAGG 

TAGTTTAAAAGAATATTTAGATTCTTTTTAA 

£797 .aa 

MSIKKFILTLIILSIAKJSrSFSENEINIFENENYIVKENIKTEIKKLKQSFLLASVDVAISQPYIEIA^ 

LEGISYSFINVFSKIGSSAIISFDLSNEASKKYKIIKLEFLSPDKGNFINQLSSLTSGKQQSKKELAKDAYSFGTL 

RTESLSKTIAEYYKDNNWYYILAAITVENNINKETEKYEIRINPKIYNDFQKKLRLHFKSN^ 

t797.aa 

KNSFSENEINIFENENYIVKENIKTEIKKLKQSFLLASVDVAISQPYIELADLNGEPIKELEGISYSFINVFSKIG 
SSAIISFDLSNEASKKYKIIKLEFLSPDKGNFINQLSSLTSGKQQSKKELAKDAYSFGTLRTESLSKTIAEYYKDN 
NWYYILAAITVENNINKETEKYEIRINPKIYNDFQKKLRLHFKSNQIKKFPIPIIE 

£797 .nt 

ATGAGCATTAAAAAATTTATTTTAACCTTGATAATTCTTTCTCTAGCTAAAAATAGCTTTTCTGAAAACGAAATTA 
ATATCTTCGAAAACGAAAATTATATTGTAAAAGAAAATATAAAAACAGAAATTAAAAAACTAAAACAAAGTTTTTT 
ACTTGCATCTGTTGATGTCGCCATTAGCCAACCCTACATAGAATTGGCAGATTTAAATGGAGAACCGATAAAAGAA 
CTTGAAGGGATTAGTTATTCATTTATAAATGTATTTTCAAAAATTGGATCTTCTGCTATTATTTCATTTGACCTAT 
CAAACGAAGCTTCCAAGAAATACAAAATCATAAAATTAGAATTTTTAAGTCCAGATAAAGGCAATTTTATTAACCA 
GCTAAGCAGCCTTACTAGTGGAAAACAGCAATCAAAAAAAGAGCTTGCAAAAGACGCTTACTCATTTGGTACATTA 
AGAACTGAATCTCTTTCAAAAACAATTGCAGAATATTACAAAGATAACAACTGGTATTATATTTTAGCAGCAATAA 
CAGTAGAAAATAATATAAATAAAGAAACTGAAAAATACGAAATTAGAATTAACCCTAAAATATATAATGATTTTCA 
AAAAAAATTGAGATTACATTTTAAAAGCAACCAAATAAAAAAATTTCCAATACCCATTATAGAATAA 

t797.nt 

aaaaatagcttttctgaaaacgaaattaatatcttcgaaaacgaa^^attatattgtaaaagaaaatat;^^ 

aaattaaaaaactaaaacaaagttttttacttgcatctgttgatgtcgccattagccaaccctacatagaattggc 

agatttaaatggagaaccgataaaagaacttgaagggattagttattcatttataaatgtattttcaaaaattgga 

tcttctgctattatttcatttgacctatcaaacgaagcttccaagaaatacaaaatcataaaattagaatttttaa 

gtccagataaaggcaattttattaaccagctaagcagccttactagtggaaaacagcaatcaaaaaaagagcttgc 

aaaagacgcttactcatttggtacattaagaactgaatctctttcaaaaacaattgcagaatattaca;^ 

aactggtattatattttagcagcaataacagtagaaaataatataaataaagaaactgaaaaatacgaaattagaa 

ttaaccctaaaatatataatgattttcaaaaaaaattgagattacattttaaaagcaaccaaataaaaaaatttcc 

aatacccattatagaataa 

f799.aa 

MKKiilllGIIFVAILLFFKILLIPRIQNHEmKNNIKMIISYKQD 
lESNLLYISSKNFTTYANIIYQNESLLSIILKSNGNNNVFYSKRIKPRGKI 

t799.aa 

HE^^SrKlmIKMIISYKQDKNRLSLKINIKTKKTTNLGKAKLDIYLDSKLIESNLLYIS^ 
I IliKSNGNNNVFYSKRI KPRGKI 
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f799.nt 

atgaaaaaacatatcattattgggataatctttgttgcaattcttttattttttaaaattttattaattcccagaa 
ttcaaaatcacgaaaataataaaaataatatcaaaatgataataagctacaagcaagacaaaaacagattatcgct 
aaagataaacataaaaacaaaaaaaactaccaacctgggaaaagccaaactagatatttatctagacagtaaatta 
attgaaagcaatttgctttatataagcagcaaaaactttacaacatatgctaatataatctatcaaaatgaaagtt 
tattaagtataatattaaagagtaatggcaataataatgtcttttatagtaaaagaataaaacctagaggtaa;^ 

ATGA 
t799 .nt 

cacgaaaataataaaaataatatcaaaatgataataagctacaagcaagacaaaaacagattatcgctaaagataa 
acataaaaacaaaaaaaactaccaacctgggaaaagccaaactagatatttatctagacagtaaattaattgaaag 
caatttgctttatataagcagcaaaaactttacaacatatgctaatataatctatcaaaatgaaagtttattaagt 
ataatattaaagagtaatggcaataataatgtcttttatagtaaaagaataaaacctagaggtaaaatatga 

f 800 .aa 

mkkhykalilsllfaiiscntktlnelgeeqfkipfgtlpgaimplnnkftnskfdiktynglvyiaeiktnklmi 
fnsygkliqtyqngifktnpdlkikkidfegiqaiyplkdfiivadklnnkkskfnqkeniayfmrililnknssv 
eilgqeglngmpfpqiydvnvdengniaiisiysegyiiysynkefsplykiyvnknllktidnqkkkynisidkv 
ffewkktlyvkttyyenigdnenindlgikikdqyiykmslkknkelevinkialpknllddkqesfiniikiqk 
dkiiastnmknlsl^liwkldskgsikeqialieppnljyrflseslskdgilsilyggktgvsvyw^ 

t800 .aa 

KTLNELGEEQFKIPFGTLPGAIMPLNNKFTNSKFDIKTYNGLVYIAEIKTNKLMIFNSYGKLIQTYQNGIFKTNPD 

LKIKKIDFEGIQAIYPLKDFIIVADKLNNKKSKFNQKENIAYFl^ILIIJSrKNSSVEILGQEGLNGMPFPQTY^ 

DENGNIAIISIYSEGYIIYSYNKEFSPLYKIYWKNLLKTIDNQKKKYNISIDKVFFEVNKKTLYVKTTYYENIGD 

NENINDLGIKIKDQYIYKMSLKKNKELEVINKIALPKNLLDDKQESFINIIKIQKDKIIASTNMKNLSNNLIWKLD 

SKGSIKEQIALIEPPNLMFLSESLSKDGILSILYGGKTGVSVYWWNLNALIiKL 

f 800 .nt 

ATGAAAAAACACTATAAAGCTCTTATATTAAGCTTGCTTTTTGCAATTATATCATGTAATACTAAAACTTTAAACG 

AATTAGGAGAAGAACAATTTAAAATACCATTTGGAACACTTCCTGGTGCAATAATGCCTCTGAATAACAAATTTAC 

AAATTCAAAATTTGACATCAAAACGTATAACGGGCTAGTGTACATTGCAGAAATAAAAACAAATAAATTAATGATT 

TTCAACTCATACGGAAAACTAATACAAACATATCAAAATGGAATATTTAAAACAAACCCCGATTTAAAAATAAAA^ 

AAATAGATTTTGAAGGAATTCAAGCAATATACCCACTAAAAGATTTTATTATTGTCGCAGACAAACTAAATAATAA 

AAAATCAAAATTCAACCAAAAAGAGAATATTGCCTACTTCATGAGAATACTAATACTAAACAAAAACTCATCTGTA 

GAAATTTTGGGTCAAGAAGGTTTAAACGGAATGCCATTTCCACAAATTTATGATGTTAATGTTGATGAA^ 

ACATTGCAATAATATCAATATATAGCGAAGGATATATAATATATTCTTACAATAAAGAATTTTCCCCGCTTTATAA 

AATTTACGTCAACAAAAACCTGTTAAAAACAATAGACAATCAAAAGAAAAAATACAACATTTCAATAGATAAGGTT 

TTTTTTGAAGTCAACAAAAAAACTCTTTATGTAAAAACTACTTACTATGAAAACATTGGTGACAATGAAAATATAA 

ACGATCTTGGAATTAAAATTAAAGATCAATATATCTATAAAATGAGTTTGAAAAAAAACAAAGAATTAGAAGTGAT 

AAATAAAATTGCTCTTCCTAAAAACTTACTAGATGATAAACAAGAAAGCTTTATAAACATTATAAAAATACAAAAA 

GACAAAATAATAGCATCTACTAATATGAAAAATTTATCTAACAATTTAATATGGAAATTAGACAGCAAGGGCTCAA 

TTAAAGAACAAATAGCTTTAATTGAGCCTCCAAATTTAATGTTTCTCTCTGAGAGTTTATCTAAAGATGGAATACT 

TAGTATACTTTATGGCGGAAAAACTGGTGTTAGTGTTTACTGGTGGAATTTAAATGCATTATTAAAATTATAA 

tSOO .nt 

AAAACTTTAAACGAATTAGGAGAAGAACAATTTAAAATACCATTTGGAACACTTCCTGGTGCAATAATGCCTC 
ATAACAAATTTACAAATTCAAAATTTGACATCAAAACGTATAACGGGCTAGTGTACATTGCAGAAATAAAAACAAA 
TAAATTAATGATTTTCAACTCATACGGAAAACTAATACAAACATATCAAAATGGAATATTTAAAACAAACCCCGAT 
TTAAAAATAAAAAAAATAGATTTTGAAGGAATTCAAGCAATATACCCACTAAAAGATTTTATTATTGTCGCAGAC 
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AACTAAATAATAAAAAATCAAAATTCAACCAAAAAGAGAATATTGCCTACTTCATGAGAATACTAATACTAAACA^ 
AAACTCATCTGTAGAAATTTTGGGTCAAGAAGGTTTAAACGGAATGCCATTTCCACAAATTTATGATGTTAATGTT 
GATGAAAATGGCAACATTGCAATAATATCAATATATAGCGAAGGATATATAATATATTCTTACAATAAAGAATTTT 
CCCCGCTTTATAAAATTTACGTCAACAAAAACCTGTTAAAAACAATAGACAATCAAAAGAAAAAATACAACATTTC 
AATAGATAAGGTTTTTTTTGAAGTCAACAAAAAAACTCTTTATGTAAAAACTACTTACTATGAAAACATTGGTGAC 
AATGAAAATATAAACGATCTTGGAATTAAAATTAAAGATCAATATATCTATAAAATGAGTTTGAAAAAAAACAAAG 
AATTAGAAGTGATAAATAAAATTGCTCTTCCTAAAAACTTACTAGATGATAAACAAGAAAGCTTTATAAACATTAT 
AAAAATACAAAAAGACAAAATAATAGCATCTACTAATATGAAAAATTTATCTAACAATTTAATATGGAAATTAGAC 
AGCAAGGGCTCAATTAAAGAACAAATAGCTTTAATTGAGCCTCCAAATTTAATGTTTCTCTCTGAGAGTTTATCTA 
AAGATGGAATACTTAGTATACTTTATGGCGGAAAAACTGGTGTTAGTGTTTACTGGTGGAATTTAAATGCATTATT 
AAAATTATAA 

fSlO.aa 

lynfKLFLFFIIFMFLSCDEKKSSKNLKSVKIGYVNWGGETAATIWLKWFEKMGra 

GTVSSWPTADKFYYEKLKTKFVDIXSANYEGTIQGFWPSYVPISSISELKGKGDKFKNKMIGIDAGAGTQIVTEQ 
ALNYYGLSKEYELVPSSESVMLASLDSSIKRNEWILVPLWKPHWAFSRYDIKFLDDPDLIMGGIESVHTLVRLGLE 
NDDFDAYYVFDHFYWSDDLILPLMDKNDKEPGKEYRNAVEFVEKNKEIVKTWVPEKYKTLFD 

tSlO ,aa 

CDEKKSSKNLKSVKIGYVNWGGETAATWLKWFEKMGYNAEIFSVTTSIMYQYIJISGKIDGTVSSWVPTADKFYY 
EKLKTKFVDLGA]S^YEGTIQGFWPSYVPISSISELKGKGDKFKNKMIGIDAGAGTQIVTEQAL^^YYGLSK^^ 
SSESVMLASLDSSIKRNEWILVPLWKPHWAFSRYDIKFLDDPDLIMGGIESVHTLVRLGLENDDFDAYYVFDHFYW 
SDDLILPLMDKNDKEPGKEYRNAVEFVEKNKEIVKTWVPEKYKTLFD 

fSlO.nt 

ATGTATAAATTATTTTTATTTTTTATTATTTTTATGTTTTTGTCTTGTGATGAAAAAAAGAGTTCAAAGAATTTAA 
AATCGGTAAAAATTGGATATGTGAATTGGGGTGGAGAAACGGCAGCTACAAATGTATTAAAGGTTGTTTTTGAGAA 
AATGGGCTACAATGCAGAAATATTTTCAGTTACTACGTCTATAATGTATCAATACTTAGCATCTGGAAAGATAGAC 
GGTACGGTGTCTTCTTGGGTTCCTACAGCCGATAAATTTTATTATGAAAAACTGAAAACAAAGTTTGTTGATCTTG 
GTGCAAATTATGAAGGAACCATTCAAGGTTTTGTGGTGCCAAGCTATGTTCCAATTTCCAGCATTAGTGAGCTTAA 
GGGTAAAGGTGATAAGTTTAAAAACAAAATGATTGGCATAGATGCTGGTGCGGGAACTCAAATTGTTACAGAACAA 
GCGCTTAATTATTATGGATTAAGTAAAGAGTATGAGCTAGTTCCTTCAAGTGAGAGTGTTATGCTTGCAAGTTTAG 
ATTCTTCAATAAAGAGAAACGAATGGATTTTAGTTCCTTTGTGGAAGCCTCATTGGGCTTTTTCTAGGTATGATAT 
TAAGTTTCTTGATGATCCTGATTTAATTATGGGGGGAATTGAGAGCGTGCATACTCTTGTTAGACTTGGTCTTGAA 
AATGATGATTTTGATGCATATTATGTTTTTGATCATTTTTATTGGAGCGATGATTTAATATTGCCCTTAATGGATA 
AAAATGATAAAGAGCCAGGCAAAGAATACCGCAATGCGGTTGAATTTGTTGAAAAGAATAAAGAGATTGTAAAGAC 
GTGGGTTCCAGAAAAATATAAGACCTTATTTGATTAA 

tSlO .nt 

TGTGATGT^AAAAAAGAGTTCAAAGAATTTAAAATCGGTAAAAATTGGATATGTGAATTGGGGTGGAGAAACGGCAG 

CTACTU^TGTATTAAAGGTTGTTTTTGAGAAAATGGGCTACAATGCAGAAATATTTTCAGTTACTACGTCTATAAT 

GTATCAATACTTAGCATCTGGAAAGATAGACGGTACGGTGTCTTCTTGGGTTCCTACAGCCGATAAATTTTATTAT 

GAAAAACTGAAAACAAAGTTTGTTGATCTTGGTGCAAATTATGAAGGAACCATTCAAGGTTTTGTGGTGCCAAGCT 

ATGTTCCAATTTCCAGCATTAGTGAGCTTAAGGGTAAAGGTGATAAGTTTAAAAACAAAATGATTGGCATAGATC 

TGGTGCGGGAACTCAAATTGTTACAGAACAAGCGCTTAATTATTATGGATTAAGTAAAGAGTATGAGCTAGTTCCT 

TCAAGTGAGAGTGTTATGCTTGCAAGTTTAGATTCTTCAATAAAGAGAAACGAATGGATTTTAGTTCCTTT^ 

AGCCTCATTGGGCTTTTTCTAGGTATGATATTAAGTTTCTTGATGATCCTGATTTAATTATGGGGGGAATTGAGAG 

CGTGCATACTCTTGTTAGACTTGGTCTTGTU^AATGATGATTOTGATGCATATTATGTTTTTGATCATTTTTAT^^ 

AGCGATGATTTAATATTGCCCTTAATGGATAAAAATGATAAAGAGCCAGGCAAAGAATACCGCAATGCGGTTG^ 

TTGTTGAAAAGAATAAAGAGATTGTAAAGACGTGGGTTCCAGAAAAATATAAGACCTTATTTGATTAA 
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MLVKRIVGKPITMLILFSLLLMISLYTFSRLKVDLLPGIDIPQISIHTVYPGASPREVEESVSRVLESGLSSVKNL 

KNIYSVSSKESSTVSLEFYHGTDLDLVLNEIRDALELVKSSLPSKSQTPRIFRYNLKNIPVMEIVINSVRPVSELK 

RYADEIIKPGLERLIX5VAIVTVNGGSKKRVLIEVSQNRLESYGLSLSRISSIIASQNLELSAGNILENNLEYLVEV 

SGKFKSIEEIGNWIAYKIPDISSGINLSPIEIKLKDIANIKTDFEDLSEYVEYNGLPSISLSVQKRSDSNSIAVS 

NVVMNEIEKLKLSMPKDMKLEIASDSTDFIKASISTVVNSAYFGAMLAIFVIFFFLRSFRATIIIGIS 

FCLMYFWISLNIMSIAGLALGIG1WVIX:SIWIDNIYKYRQKGAKLISSSILGAQEMM 

LIFKSELGVYGDFFKDFTFTIVISLGVSLLVAIFLiVPVLSSHYVGLYTSFQKNIKNAFIRKIDAFFASIYYFLEFL 

YINLLNIVLNHKLIFGLIVFFSFIGSLLLGLLLDVTTFTRGKENSITINLNFPHKTNLEYAKFYSNRFLEIVKSEA 

KGYKSIIATLRADRITFNVLFPLKEESRDNLTQSVDYDSIKYKIMNRIGNLYPEFNIEPSISGNALGGGDSIKIKI 

SANDFEYIKDYGKILVSMLKKEIPELVNPRLSISDFQLQIGVEIDRALVYlSr/GIDMNTILNELKANINGW 

EKGLNYDIVLKLDRMDVKNLKDLEKIFITNSSGVKIPFSSIATFEKTNKAESIYRENQALTIYLNAGISPDDNLTQ 

VTAKWDFINNKVPHKEGITLKVEGEYNEFSNimQFKIIIMMAIIW 

VLIHFLAGEKLSIFAAIGMLMLVGVVVNTGIVLVDYTGLLIKRGFGLREAIIESCRSRLRPILMSSLTSIIGLIPM 
AFSSGSGNELLKPIAFTFIGGMTASTFLTLFFIPMLFEIFPTCFKFQI 

t814 .aa 

RLKVDLLPGIDIPQISIHTVYPGASPREVEESVSRVLESGLSSVKNLKNIYSVSSKESSTVSLEFYHGTDLDLVLN 

EIRDALELVKSSLPSKSQTPRIFRYNLKNIPVMEIVINSVRPVSELKRYADEIIKPGLERLDGVAIVTVNGGSKKR 

VLIEVSQNRLESYGLSLSRISSIIASQNLELSAGNILENNLEYLVEVSGKFKSIEEIGNWIAYKIPDISSGINLS 

PIEIKLKDIANIKTDFEDLSEYVEYNGLPSISLSVQKRSDSNSIAVSNVViynsrEIEKLKLSMPKDMKLEIASDSTDF 

IKASISTVVNSAYFGAMU^IFVIFFFLRSFRATIIIGISIPIAIVLTFCLMYFWISLNIMSLAGIALGIGMVV^ 

SIWIDNIYKYRQKGAKLISSSILGAQEMMLPITSSTFTSICVFGPFLIFKSELGVYGDFFKDFTFTIVISLGVSL 

LVAIFLVPVLSSHYVGLYTSFQKNIKNAFIRKIDAFFASIYYFLEFLYINLLNIVLNHKLIFGLIVFFSFIGSLLL 

GLLLDVTTFTRGKENSITINLNFPHKTNLEYAKFYSNRFLEIVKSEAKGYKSIIATLRADRITFNVLFPLKEESRD 

NLTQSVDYDSIKYKIMNRIGNLYPEFNIEPSISGNALGGGDSIKIKISANDFEYIKDYGKILVSMLKKEIPELVNP 

RLSISDFQLQIGVEIDRALVYNYGIDMNTILNELKANINGWAGQYVEKGLNYDIVLKLDRMDVKNLKDLE 

NSSGVKIPFSSIATFEKTNKAESIYRENQALTIYLNAGISPDDNLTQVTAKVVDFINNKVPHKEGITLKVEGEYNE 

FSNIMNQFKIIIMMAIIWFGIMASQFESFLKPFIIIFTIPLTAIGWLIHFLAGEKLSIFAAIGMLMLVGVVV^ 

GIVLVDYTGLLIKRGFGLREAIIESCRSRLRPILMSSLTSIIGLIPMAFSSGSGNELLKPIAFTFIGGMTASTFLT 

LFF I PMLFEI F PTCFKFQI 

f814.nt 

ATGTTGGTAAAGAGAATAGTTGGCAAACCAATAACAATGTTGATTTTATTTTCATTGTTATTGATGATAAGTTTGT 

ATACCTTTTCAAGATTAAAAGTAGATCTTTTGCCGGGAATTGACATTCCCCAAATAAGTATTCACACTGTTTATCC 

TGGCGCTTCTCCTAGAGAAGTTGAAGAGAGTGTTTCTAGAGTCCTTGAGAGTGGCTTGAGTTCGGTAAAGAATTTA 

AAAAATATATATAGTGTATCTTCCAAAGAAAGCAGCACCGTTTCACTTGAATTTTATCATGGAACCGATTTAGATT 

TGGTTTTAAATGAAATTCGAGATGCTCTTGAATTGGTAAAATCTTCATTGCCCAGCAAATCACAGACCCCAAGAAT 

TTTTAGATACAATCTTAAAAACATCCCTGTAATGGAAATTGTTATTAATTCTGTAAGGCCAGTTTCTGAGCTTAAA 

AGATATGCCGATGAAATCATTAAACCTGGGCTTGAAAGGCTTGATGGAGTTGCAATTGTTACTGTTAATGGTGGAA 

GTAAAAAGCGTGTTTTAATTGAAGTTTCTCAAAACAGGCTGGAGTCTTATGGGCTTTCTTTGTCAAGAATATCTTC 

AATTATAGCATCCCAAAATTTGGAACTTTCAGCTGGCAATATATTGGAGAACAACTTGGAATATTTGGTTGAAGTT 

TCTGGAAAATTTAAATCAATTGAAGAGATAGGTAATGTGGTCATAGCTTATAAGATACCCGACATTTCTTCTGGCA 

TAAATTTATCTCCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAACCGATTTTGAAGATTTGTCTGAATA 

TGTTGAATATAATGGGTTGCCTTCAATTTCTTTGTCGGTTCAAAAACGTAGTGATTCTAATTCTATTGCAGTTTCT 

AATGTTGTTATGAATGAAATAGAAAAATTGAAATTATCTATGCCTAAAGATATGAAATTGGAGATTGCTTCTGATA 

GTACTGATTTTATTAAAGCATCCATTTCAACGGTTGTAAATTCAGCCTATTTTGGGGCCATGCTTGCAATATTTGT 

TATTTTTTTCTTTTTAAGAAGCTTTAGGGCCACAATAATTATTGGAATTTCTATTCCAATAGCAATTGTTTTGAC 

TTTTGTTTAATGTATTTTGTAAATATTTCTCTTAATATTATGAGTCTTGCGGGTCTTGCACTT^^ 

TTGTTGACTGTTCAATTGTTGTAATAGACAATATATACAAATATAGGCAAAAAGGAGCAAAGCTTATT^ 

TATTCTCGGAGCTCAGGAGATGATGTTGCCTATTACATCTTCAACTTTTACTTCTATTTGTGTTTTTGGTCCATTT 

CTTATTTTCAAATCAGAACTTGGGGTATATGGAGATTTTTTCAAAGACTTTACATTTACGATTGTT 

GTGTTTCTCTTTTAGTTGCAATTTTTTTGGTTCCTGTTTTATCAAGCCACTATC 

AAAGAATATTAAGAATGCTTTTATTAGGAAAATCGATGCCTTTTTTGCTAGTATTTATT^ 
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TATATCAATTTATTAAATATAGTTTTAAATCACAAATTGATTTTTGGGTTGATTGTTTTTTTTAGTT 

GCTTGCTTTTAGGATTATTGTTAGATGTGACAACTTTTACTAGAGGGAAAGAGAACTCAATTACTATTAATTTAAA 

TTTTCCCCACAAAACTAATTTGGAATATGCAAAATTTTATTCTAATAGATTTTTAGAAATTGTAAAAAGTGAGG 

AAAGGATATAAAAGTATTATTGCTACTTTGCGTGCTGATAGAATAACTTTCAACGTATTGTTTCCTCTCAAAGAAG 

AATCAAGAGATAATTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATGAATCGTATTGGTAATCT 

TTATCCTGAATTTAATATTGAGCCTTCCATTAGTGGCAATGCTTTAGGTGGTGGAGATTCTATTAAAA 

TCGGCCAATGATTTTGAATATATAAAAGATTATGGAAAAATTTTAGTTTCCATGTTAAAAAAGGAAATTC 

TTGTAT^TCCAAGGCTTAGCATAAGTGATTTTCAGCTTCAAATTGGCGTTGAGATAGACAGAGCGCTAGTTTATAA 

TTATGGTATTGACATGAATACCATTTTAAATGAGTTGAAGGCCAATATTAATGGTGTTGTTGCTGGGCAATATGTG 

GAGAAGGGACTTAATTATGATATTGTTCTTAAGCTTGATAGAATGGATGTTAAAAATTTAAAAGATTTAGAAAAAA 

TATTTATTACAAATTCATCTGGAGTTAAAATTCCTTTTTCATCAATAGCCACCTTTGAAAAAACCAATAAAGCCGA 

ATCTATTTACAGAGAAAATCAAGCTTTAACCATTTATCTTAATGCGGGTATTTCTCCAGATGATAATTTAACCCAA 

GTAACCGCAAAAGTTGTAGATTTTATTAATAATAAGGTGCCCCATAAAGAAGGCATAACTCTTAAGGTTGAAGGAG 

AATATAATGAATTTTCAAATATCATGAATCAGTTTAAAATAATCATTATGATGGCTATTATTGTTGTGTTTGGTAT 

TATGGCTTCTCAATTTGAATCTTTTTTAAAACCCTTTATTATTATTTTTACAATTCCTTTAACGGCAATAGGGGTT 

GTGCTTATACATTTTCTTGCAGGAGAAAAGCTTTCTATTTTTGCTGCAATTGGTATGCTTATGCTTGTTGGTGTTG 

TGGTAAATACAGGAATTGTTCTTGTAGACTATACTGGTTTATTGATCAAGAGGGGATTTGGCCTAAGAGAAGCAAT 

TATTGAATCTTGTCGTTCAAGGCTTAGGCCAATTTTAATGTCTTCTTTGACCTCAATAATAGGGCTTATTCCAATG 

GCATTTTCTAGCGGAAGTGGAAATGAACTTCTAAAACCAATTGCATTTACTTTTATTGGCGGAATGACAGCTAGCA 

CATTTCTTACTTTGTTTTTTATTCCCATGCTTTTTGAAATTTTTCCAACATGTTTCAAGTTTCAAATCTAG 

t814 .nt 

AGATTAAAAGTAGATCTTTTGCCGGGAATTGACATTCCCCAAATAAGTATTCACACTGTTTATCCTGGCGCTTCTC 

CTAGAGAAGTTGAAGAGAGTGTTTCTAGAGTCCTTGAGAGTGGCTTGAGTTCGGTAAAGAATTTAAAAAATATATA 

TAGTGTATCTTCCAAAGAAAGCAGCACCGTTTCACTTGAATTTTATCATGGAACCGATTTAGATTTGGTTTTAAAT 

GAAATTCGAGATGCTCTTGAATTGGTAAAATCTTCATTGCCCAGCAAATCACAGACCCCAAGAATTTTTAGATACA 

ATCTTT^^AAACATCCCTGTAATGGAAATTGTTATTAATTCTGTAAGGCCAGTTTCTGAGCTTAAAAGATATGCCGA 

TGAAATCATTAAACCTGGGCTTGAAAGGCTTGATGGAGTTGCAATTGTTACTGTTAATGGTGGAAGTAAAAAGCGT 

GTTTTAATTGAAGTTTCTCAAAACAGGCTGGAGTCTTATGGGCTTTCTTTGTCAAGAATATCTTCAATTATAGCAT 

CCCAAAATTTGGAACTTTCAGCTGGCAATATATTGGAGAACAACTTGGAATATTTGGTTGAAGTTTCTGGAAAATT 

TAAATCAATTGAAGAGATAGGTAATGTGGTCATAGCTTATAAGATACCCGACATTTCTTCTGGCATAAATTTATCT 

CCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAACCGATTTTGAAGATTTGTCTGAATATGTTGAATATA 

ATGGGTTGCCTTCAATTTCTTTGTCGGTTCAAAAACGTAGTGATTCTAATTCTATTGCAGTTTCTAATGTTGTTAT 

GAATGAAATAGAAAAATTGAAATTATCTATGCCTAAAGATATGAAATTGGAGATTGCTTCTGATAGTACTGATTTT 

ATTAAAGCATCCATTTCAACGGTTGTAAATTCAGCCTATTTTGGGGCCATGCTTGCAATATTTGTTATTTTTTTCT 

TTTTAAGAAGCTTTAGGGCCACAATAATTATTGGAATTTCTATTCCAATAGCAATTGTTTTGACCTTTTGTTTAAT 

GTATTTTGTAAATATTTCTCTTAATATTATGAGTCTTGCGGGTCTTGCACTTGGGATTGGAATGGTTGTTGACTGT 

TCAATTGTTGTAATAGACAATATATACAAATATAGGCAAAAAGGAGCAAAGCTTATTTCGTCTTCTATTCTCGGAG 

CTCAGGAGATGATGTTGCCTATTACATCTTCAACTTTTACTTCTATTTGTGTTTTTGGTCCATTTCTTATTTTCAA 

ATCAGAACTTGGGGTATATGGAGATTTTTTCAAAGACTTTACATTTACGATTGTTATTTCCTTGGGTGTT^ 

TTAGTTGCAATTTTTTTGGTTCCTGTTTTATCAAGCCACTATGTCGGTTTATACACAAGTTTCCAAAAGAATATTA 

AGAATGCTTTTATTAGGAAAATCGATGCCTTTTTTGCTAGTATTTATTATTTTTTAGAGTTTTTGTATATCAATTT 

ATTAAATATAGTTTTAAATCACAAATTGATTTTTGGGTTGATTGTTTTTTTTAGTTTTATTGGCAGCTTGCTTTTA 

GGATTATTGTTAGATGTGACAACTTTTACTAGAGGGAAAGAGAACTCAATTACTATTAATTTAAATTTTCCCCACA 

AAACTAATTTGGAATATGCAAAATTTTATTCTAATAGATTTTTAGAAATTGTAAAAAGTGAGGCTAAAGGAT^ 

AAGTATTATTGCTACTTTGCGTGCTGATAGAATAACTTTCAACGTATTCTTTCCTCTCAAAGAAGAATCAAGAGAT 

AATTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATGAATCGTATTGGTAATCTTTATCCTGAAT 

TTAATATTGAGCCTTCCATTAGTGGCAATGCTTTAGGTGGTGGAGATTCTATTAAAATTAAAATTTCGGCCAATGA 

TTTTGAATATATAAAAGATTATGGAAAAATTTTAGTTTCCATGTTAAAAAAGGAAA^ 

aggcttagcataagtgattttcagcttcaaattggcgttgagatagacagagcgctagtttataattatggtattc 

acatgaataccattttaaatgagttgaaggccaatattaatggtgttgttgctgggcaatatgtggagaa 

taattatgatattgttcttaagcttgatagaatggatgttaaaaatttaaaagatttagaaaaaatat^ 

aattcatctggagttaaaattcctttttcatcaatagccacctttgaaaaaaccaataaagccg^ 

gagaaaatcaagctttaaccatttatcttaatgcgggtatttctccagatgataatttaacccaagtaaccgca;^ 

agttgtagattttattaataataaggtgccccataaagaaggcataactcttaaggttgaaggagaatataatc 
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TTTTCAAATATCATGAATCAGTTTAAAATAATCATTATGATGGCTATTATTGTTGTGTTTGGTATTATGGCTTCTC 
AATTTGAATCTTTTTTAAAACCCTTTATTATTATTTTTACAATTCCTTTAACGGCAATAGGGGTTGTGCTTATAC^ 
TTTTCTTGCAGGAGAAAAGCTTTCTATTTTTGCTGCAATTGGTATGCTTATGCTTGT^ 

GGAATTGTTCTTGTAGACTATACTGGTTTATTGATCAAGAGGGGATTTGGCCTAAGAGAAGCAATTATTGAATCTT 
GTCGTTCAAGGCTTAGGCCAATTTTAATGTCTTCTTTGACCTCAATAATAGGGCTTATTCCAATGGCATTTTCTAG 
CGGAAGTGGAAATGAACTTCTAAAACCAATTGCATTTACTTTTATTGGCGGTU^TGACAGCTAGCACATTTCTTACT 
TTGTTTTTTATTCCCATGCTTTTTGAAATTTTTCCAACATGTTTCAAGTTTCAAATCTAG 

£818. aa 

MLKNHSKLIIQLKVVMMIYLKKMGNDMTKFYNYRIEIVSNLSLELDVFECIEKIEQELGESIYYSKIGNW 
GEKHGNGVWPEENF I L 1 1 YT SNQS I VERLKD I VDDLNRS YPTEG INLFVLRNS 

t818 .aa 

KKMGNDMTKFYNYRI E I VSNLSLELDVFEC I EKI EQELGES I YYSKIGNVYGKGKKGEKHGNGVWPEENF I L I I YT 
SNQSIVERLKDIVDDLNRSYPTEGINLFVLRNS 

fSlS.nt 

ATGTTGAAAAATCATTCAAAATTAATAATTCAACTAAAAGTAGTTATGATGATTTATTTGAAGAAGATGGGGAATG 
ATATGACTAAATTTTATAATTATAGGATTGAAATAGTTTCTAACTTATCTTTAGAGCTTGATGTTTTTGAATGTAT 
AGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATTATTCTAAGATAGGAAATGTTTATGGAAAAGGTAAGAAG 
GGAGAAAAGCATGGTAATGGCGTTTGGCCTGAAGAAAATTTTATTTTGATTATTTATACCTCCAATCAGTCTATTG 
TTGAGCGATTAAAGGATATTGTGGATGATTTGAATCGTTCTTACCCTACAGAAGGGATTAATCTTTTTGTTTTGAG 
AAATTCTTAA 

t818 -nt 

AAGAAGATGGGGAATGATATGACTAAATTTTATAATTATAGGATTGAAATAGTTTCTAACTTATCTTTAGAGCTTG 
ATGTTTTTGAATGTATAGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATTATTCTAAGATAGGAAATGTTTA 
TGGAAAAGGTAAGAAGGGAGAAAAGCATGGTAATGGCGTTTGGCCTGAAGAAAATTTTATTTTGATTATTTATACC 
TCCAATCAGTCTATTGTTGAGCGATTAAAGGATATTGTGGATGATTTGAATCGTTCTTACCCTACAGAAGGGATTA 
ATCTTTTTGTTTTGAGAAATTCTTAA 

f820.aa 

MLNNTYRIKTILTIFLAITLLTIYKYFTLMAFNNSPDNTISLKSNDIAKRGTIYDRNGKPIAFSSKSYSIGTNPQK 

lENIVSTSETLGAILQINSRILKEKLSSNKGFLYIKRKIKREESDLIKRIQAEGRLSNITLYPDYTRIYPFRNTTS 

NITGFVGTDNLGLEGIEFSIJ^SILGKDKTKQQFl^EEPETNNIHLTIDimiQQGVSKIAKiCYFKE^ 

NSQNGEILSMVQFPQYDANFYSKYPEEIRKNIiSSSLTYEPGSINKIFTVAIILESGKLNLEEKFLDNGIYQKQFPS 

GEKITIKTLNPPYKHIDSTEILIYSSNVGIAYITEKVSNEYFYKKLLDFGFGEKVGVPFPGETKGLIJSrHYSKWSGR 

SKATIGFGQEIGVSAVQILQAASILSNNGIMLKPRIIKKISNDKGENIKEFDKEEIRKVISKNSAQKVLKMMREVV 

NKGGIPNLKIKNLDISAiCSGTSQAIDRKTGKYSEEDYTSSIIAIYPTEQPKYIIYIVYRYPKKIIYGTRIAAPl^ 

EIIEFIEHQQNTIAYKKIKMPSKIKIPKAETNYKNKTYLPNFINLSKREAIDILKYYKNTMKIKINGDGFVYKQSI 

SPNTKLEDITELELYLK 

t820.aa 

FNNSPDNTISLKSNDIAKRGTIYDRNGKPIAFSSKSYSIGTNPQKIENIVSTSETLGAILQINSRILKEKLSSNKG 

FLYIKRKIKREESDLIKRIQAEGRLSNITLYPDYTRIYPFRNTTSNITGFVGTDNLGLEGIEFSLNSILGKDKTKQ 

QFLNEEPETKn^IHLTIDMDIQQGVSKIAKKYFKElSn^PESLITLVMNSQNGEILSMVQFPQYDANFYSKYPEEIRK^ 

LSSSLTYEPGSINKIFTVAIILESGKLNLEEKFLDNGIYQKQFPSGEKITIKTLNPPYKHIDSTEILIYSSNVGIA 

YITEKVSNEYFYKKLLDFGFGEKVGVPFPGETKGLLNHYSKWSGRSKATIGFGQEIGVSAVQILQAASILSNn^ 

LKPRI IKKI SNDKGENI KEFDKEEIRKVI SKNS AQK\njKMMREVVNKGG I PNLKIKNI^ 
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YSEEDYTSSILAIYPTEQPK:!fIIYIWRYPKiCIIYGTRIAAPMAKEIIEFIEHQQNTIAyKKIKMPSKIKIPKAET 
ISrnCNKTYLPNF INLSKHEAIDI LKYYKNTMKI KINGDGFVYKQS I 

f820.nt 

ATGCTTAATAACACTTATCGAATAAAAACAATATTAACAATATTCTTGGCTATAACTTTGTTAACTATTTACAAAT 
ATTTCACACTAATGGCCTTCAATAACAGCCCAGACAACACAATATCTTTAAAGTCAAATGATATTGCCAAAAGAGG 
AACAATTTATGATAGAAATGGCAAACCAATAGCATTCTCTTCAAAATCCTACTCAATTGGTACAAATCCTCAAAAA 
ATAGAAAATATTGTAAGCACATCTGAAACTCTTGGTGCAATACTTCAAATTAATTCAAGAATTTTAAAGGAAAAGC 
TTTCCTCTAACAAAGGGTTTTTATATATAAAAAGAAAAATAAAAAGAGAAGAATCAGATTTAATAAAAAGAATTCA 
AGCTGAAGGCAGGCTTTCAAACATCACTTTATATCCTGATTACACAAGAATTTATCCCTTCAGGAATACCACAAGC 
AATATTACTGGTTTTGTAGGAACAGATAATCTTGGCCTTGAGGGCATTGAATTTTCCCTAAATAGCATATTAGGAA 
AAGATAAAACCAAGCAACAATTTTTAAATGAGGAGCCAGAAACAAACAACATCCACTTAACAATAGACATGGATAT 
ACAACAAGGTGTTAGCAAAATAGCTAAAAAATACTTTAAAGAAAATAATCCTGAAAGTTTAATTACCTTGGTAATG 
T^CTCCCAAAATGGAGAAATATTATCCATGGTTCAATTTCCTCAATATGATGCAAACTTTTATTCTAAATATCCTG 
AAGAAATCCGAAAAAACCTTTCTTCATCTCTAACCTATGAGCCCGGAAGCATTAATAAAATTTTTACAGTTGCAAT 
AATATTAGAAAGTGGAAAATTAAATTTAGAAGAAAAATTTTTAGACAATGGAATATATCAAAAACAATTTCCATCA 
GGAGAAAAAATTACAATCAAAACATTAAATCCCCCCTATAAACATATCGACTCTACAGAGATTTTAATTTATTCAT 
CAAATGTTGGAATAGCTTACATTACTGAAAAAGTTAGCAATGAATACTTTTATAAAAAACTTTTAGATTTTGGCTT 
TGGGGAAAAAGTTGGAGTTCCATTTCCCGGAGAAACAAAAGGACTGCTAAATCATTATTCAAAATGGTCAGGACGA 
AGTAAAGCTACAATTGGATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATTACAAGCTGCAAGCATACTAA 
GCAATAATGGAATAATGCTAAAACCTAGAATAATAAAAAAAATAAGCAACGATAAAGGAGAAAATATTAAAGAATT 
TGATAAAGAAGAAATAAGAAAAGTAATATCCAAAAATTCAGCACAAAAAGTTTTAAAAATGATGAGAGAAGTTGTA 
AATAAAGGTGGAATTCCAAATCTTAAAATTAAAAATCTTGACATTTCTGCAAAAAGTGGAACATCTCAAGCTATTG 
ATAGAAAAACGGGAAAATACTCAGAAGAAGACTATACATCTTCTATATTGGCAATATACCCCACAGAACAACCAAA 
ATATATTATTTACATTGTATACAGATACCCAAAAAAAATAATATACGGAACAAGAATAGCAGCCCCAATGGCAAAA 
GAAATAATAGAATTTATTGAGCACCAACAAAATACAATAGCATATAAAAAAATTAAAATGCCATCAAAAATCAAGA 
TCCCTAAAGCTGAAACTAATTACAAAAACAAAACATACTTACCAAATTTTATCAACCTTTCTAAAAGAGAAGCAAT 
AGACATACTAAAATACTATAAAAATACTATGAAAATAAAAATAAATGGCGATGGATTTGTTTACAAGCAAAGTATA 
TCCCCCAATACAAAATTAGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 

t820.nt 



TTCAATAACAGCCCAGACAACACAATATCTTTAAAGTCAAATGATATTGCCAAAAGAGGAACAATTTATGATAGAA 

ATGGCAAACCAATAGCATTCTCTTCAAAATCCTACTCAATTGGTACAAATCCTCAAAAAATAGAAAATATTGTAAG 

CACATCTGAAACTCTTGGTGCAATACTTCAAATTAATTCAAGAATTTTAAAGGAAAAGCTTTCCTCTAACAAAGGG 

TTTTTATATATAAAAAGAAAAATAAAAAGAGAAGAATCAGATTTAATAAAAAGAATTCAAGCTGAAGGCAGGCTTT 

CAAACATCACTTTATATCCTGATTACACAAGAATTTATCCCTTCAGGAATACCACAAGCAATATTACTGGTTTTGT 

AGGAACAGATAATCTTGGCCTTGAGGGCATTGAATTTTCCCTAAATAGCATATTAGGAAAAGATAAAACCAAGCAA 

CAATTTTTAAATGAGGAGCCAGAAACAAACAACATCCACTTAACAATAGACATGGATATACAACAAGGTGTTAGCA 

AAATAGCTAAAAAATACTTTAAAGAAAATAATCCTGAAAGTTTAATTACCTTGGTAATGAACTCCCAAAATGGAGA 

AATATTATCCATGGTTCAATTTCCTCAATATGATGCAAACTTTTATTCTAAATATCCTGAAGAAATCCGAAAAA^ 

CTTTCTTCATCTCTAACCTATGAGCCCGGAAGCATTAATAAAATTTTTACAGTTGCAATAATATTAGAAAGTGGAA 

AATTAAATTTAGAAGAAAAATTTTTAGACAATGGAATATATCAAAAACAATTTCCATCAGGAGAAAAAATTACAAT 

CAAAACATTAAATCCCCCCTATAAACATATCGACTCTACAGAGATTTTAATTTATTCATCAAATGTTGGAATAGCT 

TACATTACTGAAAAAGTTAGCAATGAATACTTTTATAAAAAACTTTTAGATTTTGGCTTTGGGGAAAAAGTTGG^^ 

TTCCATTTCCCGGAGAAACAAAAGGACTGCTAAATCATTATTCAAAATGGTCAGGACGAAGTAAAGCTACAATTGG 

ATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATTACAAGCTGCAAGCATACTAAGCAATAATGGAATAATG 

CTAAAACCTAGAATAATAAAAAAAATAAGCAACGATAAAGGAGAAAATATTAAAGAATTTGATAAAGAAGAAATAA 

GAAAAGTAATATCCAAAAATTCAGCACAAAAAGTTTTAAAAATGATGAGAGAAGTTGTAAATAAAGGTGGAAT 

AAATCTTAAAATTAAAAATCTTGACATTTCTGCAAAAAGTGGAACATCTCAAGCTATTGATAGAA 

TACTCAGAAGAAGACTATACATCTTCTATATTGGCAATATACCCCACAGAACAACCAAAATATATTATTTACATTG 

TATACAGATACCCAAAAAAAATAATATACGGAACAAGAATAGCAGCCCCAATGGCAAAAGAAATAATAGAATTTAT 

TGAGCACCAACAAAATACAATAGCATATAAAAAAATTAAAATGCCATCAAAAATCAAGATCCCTAAAGCTGAAAC 

AATTACAAAAACAAAACATACTTACCAAATTTTATCAACCTTTCTAAAAGAGAAGCAATAGACATACTAAAATACT 
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ATAAAAATACTATGAAAATAAAAATAAATGGCGATGGATTTGTTTACAAGCAAAGTATATCCCCCAATACAAAATT 
AGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 

f831.aa 

MAKNNLLVFFIAI IFVFVSI IWFYNSLGKDYVKSGGEI VENLEKDLNDYLKEl^ 

SSYFISRFYIJ^RAVYFQSQAQYDEAIKDLDIVIKAKGIESEIAFLNKAAVYEKMGLKEDALLWEDLINSTSL^ 
KVRALLS KAI L I EEKDKELAVK VYEE I VKF P YENNL Y INMANNK I LELKQN 

t831 .aa 

YNSrXSKDYVKSGGEIVENLEKDLNDYLKENDAKEREKIFLRIRELISKEKEISSYFISRFYLARAVYFQSQAQYDE 
AIKDLDIVIKAKGIESEIAFLNKAAVYEKMGLKEDALLWEDLINSTSLDFLKVRALLSKAILIEEKDKELAVKVY 
EEIVKFPYENNLYINMANNKILELKQN 

f 831.nt 

ATGGCTAAAAATAATCTTTTAGTTTTCTTTATTGCTATTATTTTTGTGTTTGTGTCTATTATTGTTGTTTTTTATA 
ATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATCTTGAAAAAGATTTAAATGATTATTT 
AAAAGAAAATGATGCCAAAGAGAGAGAAAAAATATTTCTTAGGATAAGGGAGCTTATTTCAAAGGAAAAAGAAATT 
TCATCTTATTTTATTTCAAGGTTCTATTTAGCCAGAGCTGTTTATTTCCAAAGTCAAGCACAGTATGATGAGGCTA 
TTAAAGATTTAGATATTGTTATTAAGGCAAAAGGTATTGAAAGTGAAATTGCTTTTCTTAATAAAGCTGCAGTTTA 
TGAAAAAATGGGATTAAAAGAAGATGCTTTGTTAGTTTATGAAGATCTTATCAATAGTACTAGTTTGGATTTTTTA 
AAGGTAAGAGCTCTTTTGAGTAAGGCAATATTGATTGAGGAAAAAGATAAAGAGCTTGCTGTGAAAGTATACGAAG 
AGATrCTTAAGTTTCCGTATGAAAATAATTTATATATAAATATGGCAAATAATAAAATTTTAGAACTTAAGCAAAA 
TTAA 

t831 .nt 

TATAATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATCTTGAAAAAGATTTAAATGATT 
ATTTAAAAGAAAATGATGCCAAAGAGAGAGAAAAAATATTTCTTAGGATAAGGGAGCTTATTTCAAAGGAAAAAGA 
AATTTCATCTTATTTTATTTCAAGGTTCTATTTAGCCAGAGCTGTTTATTTCCAAAGTCAAGCACAGTATGATGAG 
GCTATTAAAGATTTAGATATTGTTATTAAGGCAAAAGGTATTGAAAGTGAAATTGCTTTTCTTAATAAAGCTGCAG 
TTTATGAAAAAATGGGATTAAAAGAAGATGCTTTGTTAGTTTATGAAGATCTTATCAATAGTACTAGTTTGGATTT 
TTTAAAGGTAAGAGCTCTTTTGAGTAAGGCAATATTGATTGAGGAAAAAGATAAAGAGCTTGCTGTGAAAGTATAC 
GAAGAGATTGTTAAGTTTCCGTATGAAAATAATTTATATATAAATATGGCAAATAATAAAATTTTAGAACTTAAGC 
AAAATTAA 

f843.aa 

MKAIGNAILLr^PLIFSIGISIGVARMGQGTAALGGLIGYLTFNITENYFIEAFSGLVEAETMSSVGR^ 
IJSrrciAGSLAVGLLVGYLHNKFYNMKLPKPFWFSECHFVPIVIILPFCVFLiAIFFCL 

FEYFGSFLYGFLNRLLLPLGLHSILSFPFEFTSLGGVEIVNGDTVRGLKNIFYAQLLDPSLGKFSSGFAKISSGFY 

LSIMFGLPGAALGWKGIVHEDKNKVAALLFSGALTAFLTGITEPLEFLFIFTAPLLYFVHAAYSGFALLI^ 

VTIGNSFSTGFLDFFMFGILQGNSKTNWISVLPLGAMFFALYYFTFSWLYRYFDFQIFVTDDPFFEGQEGKLESLG 

lAHLLIQGI^GFDNITKLDVCSTRLHVDVVNTELVDlSnSILLKEAGVLKIGLVNGKVQLFYGSlSr^ 

SLFEASVMVAVDNVKKGFKTYIEMKEDKKLEKQGKSGKTYKLSELEED 

t843 .aa 

rmgqgtaalggligyltfnitenyfieafsglvea^tmssvgrinffgvqti^tgiagsiavgllvgylhnkfy^ 

klpkpfwfsechfvpiviilpfcvfiaiffcliwssfddliasix3lfvfrfeyfgsflygflnrlllplglhsil 

sfpfeftslggveiwgdtwglknifyaqlldpslgkfssgfakissgfylsimfglpgaalgvykgivhedknk 

vaallfsgaltafltgitepleflfiftapllyfvhaaysgfalllanffnvtignsfstgfij:)Ffmfgilq^ 

tnwisvlplgamffalyyftfswlyryfdfqifvtddpffegqegkleslgiahlliqgliggfdnitk^ 
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HVDVVNTELVDNNLLKEAGVLKIGLVNGKVQLFYGSNVYYIKNAIDTYSPKSLFEASVM^ 
EDKKLEKQGKSGKTYKLSELEED 

£843 .nt 

ATGAAGGCTATAGGCAATGCTATTCTTCTCAATATGCCTTTAATTTTTTCTATTGGAATTTCTATTGGAGTTGCAA 
GAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTTAACATTTAATATTACTGAAAATTATTTTAT 
TGAGGCTTTTTCAGGGCTTGTTGAAGCAGAGACAATGTCTTCTGTTGGGCGTATAAATTTTTTTGGTGTTCAAACT 
TTAAATACGGGAATTGCAGGTTCTTTAGCGGTAGGCCTTTTAGTTGGATATTTGCATAACAAATTTTATAATATGA 
AGCTACCCAAACCTTTTGTGTTTTTTTCAGAGTGCCATTTTGTGCCTATAGTAATAATTTTACCCTTTTGTGTTTT 
TTTGGCTATATTTTTTTGTTTGATTTGGTCAAGTTTTGACGATTTAATTGCATCTTTAGGTTTGTTTGTTTTTAGG 
TTTGAATATTTTGGCAGTTTTCTTTATGGATTTTTAAATAGGCTTTTATTGCCTTTGGGGTTGCATTCTATTTTAT 
CTTTTCCTTTTGAGTTTACTTCTTTGGGAGGAGTGGAGATAGTTAATGGCGATACTGTTAGAGGTCTTAAGAATAT 
ATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTTTCATCAGGCTTTGCCAAAATTAGCAGTGGATTTTAT 
CTATCTATTATGTTTGGACTGCCCGGAGCAGCATTAGGGGTTTACAAGGGTATTGTTCATGAAGATAAAAATAAGG 
TTGCAGCACTTCTTTTCTCTGGGGCCTTGACAGCTTTTTTAACAGGAATAACTGAGCCTTTAGAATTTTTATTTAT 
TTTCACAGCGCCTTTGCTTTATTTTGTTCATGCCGCTTATTCGGGGTTTGCATTGTTGCTTGCTAATTTTTTTAAT 
GTTACGATTGGCAATAGCTTTTCTACTGGATTTTTGGATTTTTTTATGTTTGGGATACTTCAAGGAAATTCTAAGA 
CAAATTGGATTAGTGTATTACCTTTGGGGGCAATGTTTTTTGCTCTTTATTATTTTACTTTTAGTTGGCTTTATAG 
ATACTTTGATTTTCAGATATTTGTTACAGACGATCCATTTTTTGAAGGCCAAGAAGGAAAGCTAGAGAGTCTCGGA 
ATTGCGCATCTTTTAATTCAAGGTCTTGGTGGATTTGATAATATTACAAAGCTTGATGTTTGTTCTACAAGATTGC 
ATGTAGATGTTGTTAATACTGAGCTTGTTGATAATAATTTGCTTAAAGAGGCTGGAGTTCTTAAAATAGGGCTTGT 
TAATGGCAAGGTTCAGCTTTTTTATGGATCTAATGTTTATTATATTAAAAATGCCATTGATACCTATTCTCCAAAG 
AGTCTTTTTGAAGCTAGTGTTATGGTTGCAGTTGATAATGTAAAAAAAGGTTTTAAAACTTATATTGAAATGAAAG 
AAGACAAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGCTTAGCGAATTAGAAGAAGATTAG 

t843 .nt 

AGAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTTAACATTTAATATTACTGAAAATTATTTTA 

TTGAGGCTTTTTCAGGGCTTGTTGAAGCAGAGACAATGTCTTCTGTTGGGCGTATAAATTTTTTTGGTGTTCAAAC 

TTTAAATACGGGAATTGCAGGTTCTTTAGCGGTAGGCCTTTTAGTTGGATATTTGCATAACAAATTTTATAATATG 

AAGCTACCCAAACCTTTTGTGTTTTTTTCAGAGTGCCATTTTGTGCCTATAGTAATAATTTTACCCTTTTGTGTTT 

TTTTGGCTATATTTTTTTGTTTGATTTGGTCAAGTTTTGACGATTTAATTGCATCTTTAGGTTTGTTTGTTTTTAG 

GTTTGAATATTTTGGCAGTTTTCTTTATGGATTTTTAAATAGGCTTTTATTGCCTTTGGGGTTGCATTCTATTTTA 

TCTTTTCCTTTTGAGTTTACTTCTTTGGGAGGAGTGGAGATAGTTAATGGCGATACTGTTAGAGGTCTTAAGAATA 

TATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTTTCATCAGGCTTTGCCAAAATTAGCAGTGGAT^ 

TCTATCTATTATGTTTGGACTGCCCGGAGCAGCATTAGGGGTTTACAAGGGTATTGTTCATGAAGATAAAAATAAG 

GTTGCAGCACTTCTTTTCTCTGGGGCCTTGACAGCTTTTTTAACAGGAATAACTGAGCCTTTAGAATTTTTATTTA 

TTTTCACAGCGCCTTTGCTTTATTTTGTTCATGCCGCTTATTCGGGGTTTGCATTGTTGCTTGCTAATTTTTTTAA 

TGTTACGATTGGCAATAGCTTTTCTACTGGATTTTTGGATTTTTTTATGTTTGGGATACTTCAAGGAAATTCT 

ACAAATTGGATTAGTGTATTACCTTTGGGGGCAATGTTTTTTGCTCTTTATTATTTTACTTTTAGTTG^ 

GATACTTTGATTTTCAGATATTTGTTACAGACGATCCATTTTTTGAAGGCCAAGAAGGAAAGCTAGAGAGTCTCGG 

AATTGCGCATCTTTTAA1T?CAAGGTCTTGGTGGATTTGATAATATTACAAAGCTTG 

CATGTAGATGTTGTTAATACTGAGCTTGTTGATAATAATTTGCTTAAAGAGGCTGGAGTTCTTAAAATAGGGCTTG 
TTAATGGCAAGGTTCAGCTTTTTTATGGATCTAATGTTTATTATATTAAAAATGCCATTGATACCTATTCTCCAAA 
GAGTCTTTTTGAAGCTAGTGTTATGGTTGCAGTTGATAATGTAAAAAAAGGTTTTAAAACTTATATTGAAATGAAA 
GAAGACAAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGCTTAGCGAATTAGAAGAAGATTAG 

fSBO.aa 

MRFKKIFLIIFIISNLKVYSYNYAIQYKNEGIDKYYFEILNDGFGFSLSDFFDDLRSGSLIFTYVSKYNFIINLEA 

HMLTYRGYKDSPKSLISRTDLIEIGFMYYFPILLLINGKNFGEIDLGIGVKNLLFGDWGGHLMQSIIHLII^ 

IPSIKSYDSYNYRGFLSFAL^^fSYMNFI^LE]T™DLSYFADYFIKNSIGITLK^ 

YSKTQEAETGIGINYQFYSKNFFITNNU^IKNFSTK^a^LSVGGFGIIITPEEYKKISESI^EF^ 

IMIPLKIRNSLFYKINENIm^YFSISTNY^TNYNETNSFTNQLSSGIIy^/EFLPQKTFNPYLISGLFFAW 

KS I SRPIRIKNI LQVGI ENELGFLFKMLKYRNTEYI FKI YSKVNYI PI AYNLDEKKLEKHS INFNYLGIGI WK 
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tSSO.aa 

YSYNYAIQYKNEGIDKYYFEILNDGFGFSLSDFFDDLRSGSLIFTYVSKYNFIINLEAHMLTYRGYKDSPKSLISR 

TDLlEIGFMYYFPILLLINGKNFGEIDLGIGVKNLLFGDWGGHLMQSIIHLII^QHRPIPSIKSYDSYlSn^ 

ALNYSYMNFLNLENYiymLSYFADYFIKNSIGITLKNENIGFDIKLYSQIQNQIKSLKT^ 

SKNFFITNNLNIKNFSTKENFLSVGGFGIIITPEEYKKISESNNEFNVISNNFYFGFDIMIPLKIRNSLFYKINEN 
INHYFSISTNYYTNYNETNSFTNQLSSGIMYEFLPQKTFNPYLISGLFFAYNQNNKDIKSISRPIRIKNILQVGIE 
NEI^FLFKMLKYRNTEYIFKIYSKVNYIPIAYNLDEKKLEKHSINFNYLGIGIWK 

fSSO.nt 

ATGCGGTTTAAAAAAATATTTTTAATAATATTTATAATTTCTAATTTAAAAGTTTATTCTTATAATTATGCAATCC 
AATATAAAAATGAAGGTATTGACAAATATTATTTTGAAATACTAAATGATGGATTCGGATTTTCATTAAGCGATTT 
TTTTGATGACTTGAGAAGTGGTTCTCTTATTTTTACCTATGTTTCAAAATACAATTTTATAATAAATTTAGAAGCA 
CACATGTTAACCTATAGGGGTTATAAAGACTCTCCGAAATCTTTAATTAGTAGAACAGACTTAATTGAAATAGGCT 
TCATGTACTATTTTCCAATTTTATTGCTAATTAATGGAAAAAATTTTGGAGAAATAGACTTGGGAATTGGAGTTAA 
AAACTTATTATTTGGAGACTGGGGAGGGCATTTAATGCAAAGCATAATTCACCTCATTTTAAATCAACACCGTCCA 
ATTCCAAGTATAAAAAGCTACGACAGCTACAATTATAGAGGATTTTT7VAGCTTTGCTCTAAATTACTCTTACATGA 
ATTTTTTAAATTTAGAAAATTATATGGACTTATCTTATTTTGCAGATTATTTTATTAAAAACAGTATTGGAATTAC 
CTTAAAAAATGAAAATATTGGATTTGATATAAAACTTTATTCCCAAATTCAAAATCAAATCAAAAGCCTCAAAACA 
TATTCAAAAACACAAGAAGCAGAAACAGGAATTGGAATAAATTATCAATTTTACTCTAAAAATTTTTTCATAACCA 
ATAATTTAAACATTAAAAATTTTTCAACCAAAGAAAATTTCTTAAGCGTTGGGGGATTTGGAATAATCATTACACC 
TGAAGAATACAAAAAAATATCAGAATCAAATAATGAATTTAATGTTATAAGTAATAATTTTTACTTTGGATTTGAT 
ATTATGATCCCATTAAAAATAAGAT^TTCATTATTTTATAAAATAAATGAAAACATCAACCATTACTTTTCAATAT 
CAACAAATTATTACACTAATTATAATGAAACTAATAGCTTTACAAATCAATTATCATCAGGCATCATGTATGAATT 
TTTACCACAAAAAACATTCAATCCTTACCTAATTTCGGGATTATTTTTTGCCTATAATCAAAACAATAAAGATATC 
AAAAGCATCTCAAGACCAATAAGAATAAAAAACATTCTTCAAGTTGGAATTGAAAATGAATTAGGATTTTTGTTCA 
AAATGCTAAAATACCGCAACACTGAGTATATTTTCAAAATATATTCAAAAGTTAACTATATTCCTATAGCTTATAA 
CTTAGATGAAAAAAAATTAGAAAAACATTCTATTAACTTTAATTATTTAGGAATTGGAATAGTCGTTAAATAA 

t850 .nt 

TATTCTTATAATTATGCAATCCAATATAAAAATGAAGGTATTGACAAATATTATTTTGAAATACTAAATG 

TCGGATTTTCATTAAGCGATTTTTTTGATGACTTGAGAAGTGGTTCTCTTATTTTTACCTATGTTTCAAAATACAA 

TTTTATAATAAATTTAGAAGCACACATGTTAACCTATAGGGGTTATAAAGACTCTCCGAAATCTTTAATTAGTAGA 

ACAGACTTAATTGAAATAGGCTTCATGTACTATTTTCCAATTTTATTGCTAATTAATGGAAAAAATTTTGGAGAAA 

TAGACTTGGGAATTGGAGTTAAAAACTTATTATTTGGAGACTGGGGAGGGCATTTAATGCAAAGCATAATTCACCT 

CATTTTAAATCAACACCGTCCAATTCCAAGTATAAAAAGCTACGACAGCTACAATTATAGAGGATTTTTAAGCTTT 

GCTCTAAATTACTCTTACATGAATTTTTTAAATTTAGAAAATTATATGGACTTATCTTATTTTGCAGATTATTTTA 

TTAAAAACAGTATTGGAATTACCTTAAAAAATGAAAATATTGGATTTGATATAAAACTTTATTCCCAAATTCAAAA 

TCAAATCAAAAGCCTCAAAACATATTCAAAAACACAAGAAGCAGAAACAGGAATTGGAATAAATTATCAATTTTAC 

TCTAAAAATTTTTTCATAACCAATAATTTAAACATTAAAAATTTTTCAACCAAAGAAAATTTCTTAAGCGTTGGGG 

GATTTGGAATAATCATTACACCTGAAGAATACAAAAAAATATCAGAATCAAATAATGAATTTAATGTTATAAGTAA 

TAATTTTTACTTTGGATTTGATATTATGATCCCATTAAAAATAAGAAATTCATTATTTTATAAAATAAATGAAAAC 

ATCAACCATTACTTTTCAATATCAACAAATTATTACACTAATTATAATGAAACTAATAGCTTTACAAATCAATTAT 

CATCAGGCATCATGTATGAATTTTTACCACAAAAAACATTCAATCCTTACCTAATTTCGGGATTATTTTTTGCCTA 

TAATCAAAACAATAAAGATATCAAAAGCATCTCAAGACCAATAAGAATAAAAAACATTCTTCAAGTTGGAATTGAA 

AATGAATTAGGATTTTTGTTCAAAATGCTAAAATACCGCAACACTGAGTATATTTTCAAAATATATTCAAAAGTTA 

ACTATATTCCTATAGCTTATAACTTAGATGAAAAAAAATTAGAAAAACATTCTATTAACTTTAATTATTTAGGAAT 

TGGAATAGTCGTTAAATAA 

f853.aa 

MKSFLFWILGTVGISSFAQNTPVAI INLYKNEI ITKTGFDSKVDIFKKTQGRDLTDAEKKQVLQVIilADVLFSQE 
ASKQGIKISDDEVMQTIRTQFGLVNFTDEQIKQMIEKQGTNWGELLSSMKRSLSSQKLVLKQAQPKFSEIKTPSEK 
EIVEYYEANKTKFWPDISRVSHIFFSTKDKKi^SDVLDQAKNILSQIRSKKITFEEAVRKYS^roESSKAKN 
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LSRGDQNAQNLLGADFVKEVFNFNKGDISSPIASKEGFHIVKVTEKYAQRFLGIJ^DKVSPTADLIVKDAIRNNMIN 
VQQQQ I WQ VQQDMYGKLNKS ANI Q I LD S S LK 

t853 .aa 

Q^^^PVAII^^JYKNEIITKTGFDSKVDIFKKTQGRDLTDAEKKQVLQVLIADVLFSQEASKQGIKISDDEVM 

QFGLVNFTDEQIKQMIEKQGTNWGELLSSMKRSLSSQKLVLKQAQPKFSEIKTPSEKEIVEYYEANKTKFVNPDIS 

RVSHIFFSTKDKKRSDVLIX^AKNILSQIRSKKITFEEAVRKYSNDESSKAKNGDLGFLSRGDQNAQNLLGADFVKE 

VFNFNKGDI SSPIASKEGFHI VKWEKYAQRFLGUSTDKVS PTADLIVKDAIRNT^ 

KSANIQILDSSLK 

f853.nt 

ATGAAGAGTTTTTTATTTTGGGTAATATTGGGAACTGTAGGGATTAGCTCTTTTGCTCAAAATACTCCTGTTGCTA 
TTATTAATTTATATAAGAATGAAATTATTACTAAAACTGGTTTTGATTCTAAGGTTGATATATTTAAAAAGACCCA 
AGGTAGAGACTTAACTGATGCTGAGAAAAAGCAAGTTCTGCAAGTTTTAATAGCAGATGTTCTTTTTAGTCAAGAG 
GCTTCAAAGCAAGGAATTAAAATCTCAGATGATGAGGTTATGCAAACAATTAGAACTCAATTTGGGCTTGTGAATT 
TTACTGATGAACAAATCAAGCAAATGATAGAAAAACAAGGTACAAATTGGGGCGAGCTTTTGTCTTCAATGAAAAG 
ATCTCTGTCTTCTCAAAAGCTTGTTTTAAAGCAAGCTCAGCCTAAGTTTTCTGAAATTAAAACTCCTAGTGAGAAA 
GAAATTGTTGAGTATTATGAGGCTAATAAAACTAAGTTTGTAAATCCCGATATTTCAAGAGTTAGTCATATCTTTT 
TTTCTACTAAAGATAAAAAAAGATCAGATGTTTTAGATCAAGCAAAAAATATTTTAAGCCAAATAAGATCAAAAAA 
AATTACTTTTGAAGAAGCTGTAAGAAAATATTCAAATGACGAATCTTCTAAGGCTAAAAATGGTGATCTTGGGTTT 
TTATCAAGAGGTGATCAAAATGCTCAAAATCTTCTTGGAGCCGATTTTGTGAAAGAGGTTTTTAATTTTAATAAGG 
GTGATATATCTTCGCCTATTGCTTCAAAGGAAGGGTTTCATATTGTTAAAGTTACAGAAAAATATGCTCAGAGATT 
TTTAGGTTTGAATGATAAAGTGTCTCCTACTGCAGATTTGATTGTCAAAGATGCAATAAGAAATAACATGATTAAT 
GTTCAACAACAGCAAATTGTTGTTCAAGTACAGCAAGATATGTATGGTAAGCTTAACAAGTCTGCAAATATACAAA 
TCTTGGATTCTAGTCTAAAATAA 

t853 .nt 

CAAAATACTCCTGTTGCTATTATTAATTTATATAAGAATGAAATTATTACTAAAACTGGTTTTGATTCTAAGGTTG 

ATATATTTAAAAAGACCCAAGGTAGAGACTTAACTGATGCTGAGAAAAAGCAAGTTCTGCAAGTTTTAATAGCAGA 

TCTTCrrTTTAGTCAAGAGGCTTCAAAGCAAGGAATTAAAATCTCAGATGATGAGGTTATGCAAACAATTAGAACT 

CAATTTGGGCTTGTGAATTTTACTGATGAACAAATCAAGCAAATGATAGAAAAACAAGGTACAAATTGGGGCGAGC 

TTTTGTCTTCAATGAAAAGATCTCTGTCTTCTCAAAAGCTTGTTTTAAAGCAAGCTCAGCCTAAGTTTTCTGAAAT 

TAAAACTCCTAGTGAGAAAGAAATTGTTGAGTATTATGAGGCTAATAAAACTAAGTTTGTAAATCCCGATATTTCA 

AGAGTTAGTCATATCTTTTTTTCTACTAAAGATAAAAAAAGATCAGATGTTTTAGATCAAGCAAAAAATATTTTAA 

GCCAAATAAGATCAAAAAAAATTACTTTTGAAGAAGCTGTAAGAAAATATTCAAATGACGAATCTTCTAAG 

AAATGGTGATCTTCGGTTTTTATCAAGAGGTGATCAAAATGCTCAAAATCTTCTTGGAGCCGATTTTGTGA^ 

GTTTTTAATTTTAATAAGGGTGATATATCTTCGCCTATTGCTTCAAAGGAAGGGTTTCATATTGTTAAAGTTA 

AAAAATATGCTCAGAGATTTTTAGGTTTGAATGATAAAGTGTCTCCTACTGCAGATTTGATTGTCAAAGATGCAAT 

AAGAAATAACATGATTAATGTTCAACAACAGCAAATTGTTGTTCAAGTACAGCAAGATATGTATGGTAAGCTTAAC 

AAGTCTGCAAATATACAAATCTTGGATTCTAGTCTAAAATAA 

f859.aa 

MKLPKLYKLILLFLFTTRLFSVKDEKSDNKLELFSNVETKIKKNSKNYDSNSNSKKIKKESILKRDTNSEKNINSN 

lYIQKSKKIbn^PNRNLGNNINQKTANDVNFTKTSYVKVYPNYKDDNFQEIKNANKFPAKTEKTHMLIGPIL 

IIIKMLKTKGYTLIEYIEDNN 

t859 .aa 

VKDEKSDNKLELFSNVETKIKKNSKNYDSNSNSKKIKKESILKRDTNSEKNINSNIYIQKSKK 
QKTA 



f859.nt 
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ATGAAATTACCAAAACTTTACAAATTAATACTACTCTTTCTTTTTACAACAAGATTGTTTTCAGTAAAAGATGAAA 
AATCAGACAATAAATTGGAATTATTTTCAAACGTAGAAACAAAAATCAAAAAAAATTCTAAAAATTACGACTCAAA 
TTCAAACAGCAAAAAGATCAAAAAAGAATCAATTTTAAAAAGAGATACAAACAGCGAAAAAAATATAAATTCCAAT 
ATATACATACAAAAATCAAAAAAAATTAATTACCCCAACAGAAATTTAGGCAATAATATCAATCAAAAAACTGCAA 
ATGATGTAAATTTTACAAAAACTAGTTATGTTAAAGTTTATCCCAACTATAAAGACGATAACTTTCAAGAAATTAA 
AAATGCTAATAAATTTCCAGCTAAAACCGAAAAAACTCACATGCTAATCGGCCCAATATTAAAAGATAATCTAGGA 
ATAATAATTAAAATGCTAAAAACAAAGGGATACACTTTAATAGAATACATAGAGGACAATAATTAA 

t859 .nt 

GTAAAAGATGAAAAATCAGACAATAAATTGGAATTATTTTCAAACGTAGAAACAAAAATCAAAAAAAATTCTAAAA 
ATTACGACTCAAATTCAAACAGCAAAAAGATCAAAAAAGAATCAATTTTAAAAAGAGATACAAACAGCGAAAAAAA 
TATAAATTCCAATATATACATACAAAAATCAAAAAAAATTAATTACCCCAACAGAAATTTAGGCT^TAATATCAAT 
CAAAAAACTGCAAATGATGTAAATTTTACAAAAACTAGTTATGTTAAAGTTTATCCCAACTATAAAGACGATAACT 
TTCAAGAAATTAAAAATGCTAATAAATTTCCAGCTAAAACCGAAAAAACTCACATGCTAATCGGCCCAATATTAAA 
AGATAATCTAGGAATAATAATTAAAATGCTAAAAACAAAGGGATACACTTTAATAGAATACATAGAGGACAATAAT 
TAA 

fSei.aa 

MKNFKEVIIIFDSGIGGLSYFKYIKSRIGGCQYVYVADNKNFPYGEKSPEYLLEAVLFLIEKLKKIYNIGALVLAC 
NTISVSVYNKLNFVFPVVYTLPDVSSVSDLVLKJ^VLLIATNTTLESKFVKDQVNIHNDLIVKAAGELVNFVEYGEN 
YKKYALRCLEALKFEVVNTGREIVFLGCTHYLHLKVMIEDFLKIPWENRELVVKNLIRSiyn^ 
FVDDEFYLTENKNLTFYQNFCKKYNLRFKGMIV 

t861,aa 

RIGGCQYVYVADNKNFPYGEKSPEYLLEAVLFLIEKLKKIYNIGALVI^CNTISVSVYNKLNFVFPVVYTLPDVSS 

VSDLVLKRVLLIATNTTLESKFVKDQVNIHNDLIVKAAGELVNFVEYGENYKKYALRCLEALKFEWNTGREIVFL 

GCTHYLHLKVMIEDFLKIPVYENRELVVTCNLIRSl^FSEHKGNYYKNDFDFVDDEFYLTENK^ 

RFKGMIV 

f861.nt 

ATGAAAAATTTCAAAGAAGTAATAATTATTTTTGATTCAGGAATAGGAGGGCTTTCTTATTTTAAATATATTAAAA 

GTAGAATAGGGGGATGCCAATATGTTTATGTTGCCGATAATAAAAATTTCCCTTATGGAGAAAAAAGTCCTGAATA 

TCTTCTAGAAGCAGTTTTGTTTTTGATTGAGAAGCTTAAAAAAATCTATAATATTGGTGCATTAGTTTTGGCTTC 

AATACAATTTCTGTTAGTGTATACAATAAATTAAATTTTGTTTTTCCAGTAGTCTATACTTTGCCAGATGTAAGTT 

CAGTTTCAGATCTTGTTTTAAAAAGAGTTCTTTTGATTGCAACAAATACTACTCTTGAAAGCAAATTTGTTAAGG 

TCAAGTAAATATACATAATGATTTGATTGTAAAAGCTGCTGGAGAGCTTGTTAATTTTGTTGAATATGG^^ 

TACAAAAAATATGCTCTTAGATGTTTAGAAGCTTTAAAATTTGAAGTTGTAAATACTGGTAGAGAAATTGTTTTTC 

TTGGATGCACGCATTATTTGCATCTTAAGGTAATGATAGAAGATTTTTTAAAAATTCCTGTTTATGAGAATC^ 

ATTAGTGGTAAAAAATCTTATTAGATCAATGAATTTTTCTGAACACAAAGGTAATTATTATAAGAATGATTTTGAT 

TTTGTAGATGATGAGTTTTATTTGACCGAAAATAAAAATTTGACTTTTTATCAAAATTTTTGCAAAAAAT^ 

TTCGCTTTAAGGGAATGATAGTTTGA 

tsei.nt 

agaatagggggatgccaatatgtttatgttgccgataataaaaatttcccttatggagaaaaaagtcctgaatatc 

ttctagt^gcagttttgtttttgattgagaagcttaaaaaaatctataatattggtgcattagttttggcttgtaa 

tacaatttctgttagtgtatacaataaattat^ttttgtttttccagtagtctatactttgccagatgtaagttca 

gtttcagatcttgttttaaaaagagttcttttgattgcaacaaatactactcttgaaagcaaatttc 

aagtaaatatacataatgatttgattgtaaaagctgctggagagcttgttaattttgttgaatatggagag;^ 

CAAAAAATATGCTCTTAGATGTTTAGAAGCTTTAAAATTTGAAGTTGTAAATACTG^ 

ggatgcacgcattatttgcatcttaaggtaatgatagaagattttttaaaaattcctgt^ 
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TAGTGGTAAAAAATCTTATTAGATCAATGAATTTTTCTGAACACAAAGGTAATTATTATAAGAATGATTT^ 

TGTAGATGATGAGTTTTATTTGACCGAAAATAAAAATTTGACTTTTTATCAAAATTTTTGCAAAAAATATAATCTT 

CGCTTTAAGGGAATGATAGTTTGA 

£363 .aa 

MIRLKVLILCLFGIFVLNGFADTNFEFNFGGGVAFPVSPFSSFYNEALEINAKLKQNLPSDLSPIEKEEIVQNFSD 

LANIAKAGIRYGTYAQFGAKFDDFVSIGFELLFNINLLKAIKRSDGTANENFSFIMAITPRFYTKLDFFVI^^ 

TGPKINIATSSADSVIAELGTMGWDIGARLSFSFLILEGYYVWNIKNPKFSDFKFGIGFEFGIV 

t363 .aa 

DTNFEFNFGGGVAFPVSPFSSFYNEALEINAKLKQNLPSDLSPIEKEETVQNFSDLANIAKAGIRYGTYAQFGAKF 
DDFVSIGFELLFNINLLKAIKRSDGTANENFSFIMAITPRFYTKLDFFVLALAFFTGPKINIATSSADSVLAELGT 
MGWDIGARLSFSFLILEGYYVWNIKNPKFSDFKFGIGFEFGIV 

f363.nt 

ATGATTAGGCTTAAAGTTTTAATTTTGTGTTTATTTGGGATTTTTGTGTTAAATGGTTTTGCAGATACTAATTTTG 

AATTCAATTTTGGTGGTGGGGTTGCTTTTCCTGTTAGTCCCTTTTCAAGCTTTTACAATGAGGCTTTAGAGATTAA 

TGCAAAGCTTAAGCAAAATTTGCCTTCAGATTTATCCCCAATAGAAAAAGAAGAGATAGTCCAAAATTTTTCCGAT 

TTAGCCAATATTGCTAAAGCTGGAATAAGATATGGAACTTACGCTCAATTTGGCGCTAAATTTGATGATTTTGTTT 

CTATTGGATTTGAGCTTTTGTTTAACATTAATCTTCTTAAAGCAATAAAGCGTTCGGATGGAACTGCAAATGAAAA 

TTTCTCGTTTATTATGGCAATAACACCAAGATTTTATACAAAATTAGATTTTTTTGTTTTAGCTTTAGCGTTTTTC 

ACAGGTCCTAAGATCAATATAGCGACTTCTTCTGCGGATTCTGTTTTAGCAGAACTGGGAACAATGGGCTGGGATA 

TTGGTGCTAGACTTTCATTTTCTTTTTTAATTCTTGAAGGGTACTATGTTTGGAATATTAAAAACCCTAAATTTTC 

TGATTTCAAGTTTGGAATAGGTTTTGAATTTG 

GAATTGTGTAG 

t363 .nt 

GATACTAATTTTGAATTCAATTTTGGTGGTGGGGTTGCTTTTCCTGTTAGTCCCTTTTCAAGCTTTTACAATGAGG 
CTTTAGAGATTAATGCAAAGCTTAAGCAAAATTTGCCTTCAGATTTATCCCCAATAGAAAAAGAAGAGATAGTCCA 
AAATTTTTCCGATTTAGCCAATATTGCTAAAGCTGGAATAAGATATGGAACTTACGCTCAATTTGGCGCTAAATTT 
GATGATTTTGTTTCTATTGGATTTGAGCTTTTGTTTAACATTAATCTTCTTAAAGCAATAAAGCGTTCGGATGGAA 
CTGCAAATGAAAATTTCTCGTTTATTATGGCAATAACACCAAGATTTTATACAAAATTAGATTTTTTTGTTTTAGC 
TTTAGCGTTTTTCACAGGTCCTAAGATCAATATAGCGACTTCTTCTGCGGATTCTGTTTTAGCAGAACTGGGAACA 
ATGGGCTGGGATATTGGTGCTAGACTTTCATTTTCTTTTTTAATTCTTGAAGGGTACTATGTTTGGAATATTAAAA 
ACCCTAAATTTTCTGATTTCAAGTTTGGAATAGGTTTTGAATTTGGAATTGTGTAG 

f368.aa 

MIDLTQEKQEILIKNKFLAKVFGLMSIGLLISAVFAYATSENQTIKAIIFSNSMSFMAMILIQFGLVYAIS 
ISSNTATALFLLYSALTGVTLSSIFMIYTQGS IVFTFGITAGTFLGMSVYGYTTTTDLTKMGSYLIMGLWGI 1 1 AS 
LVNMFFRSSGLNFLISILGWIFTGLTAYDVQNISKJmKMLQDDTEIKNRMAWASLKLYLDFINLFLYLL^ 
RRND 

t368 .aa 

TSENQTIKAIIFSNSMSFMAMILIQFGLVYAISGALNKISSNTATALFLLYSALTGVTLSSIFMIYTQGSIVFTFG 
ITAGTFLGMSWGYTTTTDLTKMGSYLIMGLWGIIIASLVNMFFRSSGLNFLISIL/SWIFTGLTAYDVQNISK^ 
KMLQDDTEIKNRMAWASLKLYLDFINLFLYLLRFLGQRRND 



f368.nt 
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ATGATCGATTTAACACAAGAAAAACAAGAAATACTAATAAAAAACAAGTTTTTAGCCAAAGTT^ 

CAATTGGACTTTTAATCTCAGCAGTATTTGCATATGCAACCTCAGAAAATCAAACAATCAAAGCAATAATATTCTC 

AAATTCAATGTCATTTATGGCTATGATACTTATACAATTTGGACTTGTATATGCAATAAGT 

ATATCAAGCAATACTGCAACAGCTCTTTTCTTGCTCTACTCAGCACTAACAGGAGTAACATTATCTTCTATATTTA 

TGATTTACACACAAGGATCAATAGTATTCACATTCGGAATTACTGCTGGAACATTTCTTGGAATGTCTGTTTATC 

ATACACTACAACAACAGATCTAACAAAAATGGGAAGCTATTTAATAATGGGCTTATGGGGAATCATTATTGCATCT 

CTTGTTAATATGTTTTTTAGAAGCTCAGGTCTTAATTTCCTTATATCTATTTTGGGCGTAGTTATATTTACAC^ 

TAACAGCTTATGATGTTCAAAATATTTCTAAAATGGACAAAATGCTACAAGACGACACTGAAATAAAAAACAGAAT 

GGCGGTTGTAGCCTCACTTAAACTTTATTTAGATTTTATAAATTTATTCTTATATCTTCTAAGATTTTTGGGCCi^ 

AGAAGAAACGATTAA 

t368.nt 

ACCTCAGAAAATCAAACAATCAAAGCAATAATATTCTCAAATTCAATGTCATTTATGGCTATGATACTTATACAAT 

TTGGACTTGTATATGCAATAAGTGGTGCTCTTAATAAAATATCAAGCAATACTGCAACAGCTCTTTTCTTGCTCTA 

CTCAGCACTAACAGGAGTAACATTATCTTCTATATTTATGATTTACACACAAGGATCAATAGTATTCACATTCGGA 

ATTACTGCTGGAACATTTCTTGGAATGTCTGTTTATGGATACACTACAACAACAGATCTAACAAAAATGGGAAGCT 

ATTTAATAATGGGCTTATGGGGAATCATTATTGCATCTCTTGTTAATATGTTTTTTAGAAGCTCAGGTCTTAATTT 

CCTTATATCTATTTTGGGCGTAGTTATATTTACAGGCTTAACAGCTTATGATGTTCAAAATATTTCTAAAATGGAC 

T^AAATGCTACAAGACGACACTGAAATAAAAAACAGAATGGCGGTTGTAGCCTCACTTAAACTTTATTTAGATTOT 

TAAATTTATTCTTATATCTTCTAAGATTTTTGGGCCAAAGAAGAAACGATTAA 

f371.aa 

MKFFFLLQIALILLSNSSLLFGQSPPKEKEDSLLLYKEGKFKEAILNTLEEIRLNPSNLDARTILIWSLIAIGEYK 
RAEKEAIIGLGIKKHDIRIIQALGEAYFFQKNYDNALKYFQEYISLDSKGARIIKVYNLIADSFYELKRYNEADFA 
YEHALRFSPNNQNLLIKLARSRINAKNKILAEEALIKILTISPNNLEAKNLLEELKKSNNKP 

t371.aa 

EDSLLLYKEGKFKEAILNTLEEIRLNPSNLDARTILIWSLIAIGEYKRAEKEAIIGLGIKKHDIRIIQALGEAYFF 

QKNYDNALKYFQEYISLDSKGARIIKVYNLIADSFYELKRYNEADFAYEHALRFSPNNQNLLIKIJaiSRIN^^ 

LAEEALIKILTISPNNLEAKNLLEELKKSNNKP 

f371.nt 

ATGAAATTTTTTTTTCTATTACAAATAGCTTTAATTCTACTATCCAATTCAAGCTTGTTATTTGGACAATCACCGC 

CTAAAGAAAAAGAAGACTCTCTTCTTCTATATAAAGAAGGT^AAATTTAAAGAAGCTATTTTAAA 

AATTCGACTAAATCCTAGTAACTTAGATGCTAGGACAATATTGATATGGAGCTTAATAGCCATAGGAGAATACAAG 

AGAGCTGAAAAAGAGGCGATTATAGGACTTGGCATTAAAAAACATGACATAAGAATTATTCAAGCACTAGGAGA^ 

CTTATTTCTTTCAAAAAAATTATGACAATGCATTAAAATACTTTCAAGAATACATTAGCCTTGATTCTAAAGGAGC 

AAGAATAATAAAAGTTTATAATTTAATTGCAGATTCTTTTTATGAGCTAAAAAGATATAATGAAGCCGATTTTGCA 

TACGAACATGCATTACGTTTTTCTCCTAATAACCAAAATCTATTAATAAAATTAGCAAGATCAAGAATAAATGCAA 

AAAATAAAATATTAGCAGAAGAAGCACTAATTAAAATTCTTACAATCTCTCCTAATAATCTAGAGGCAAAAAATTT 

ACTAGAAGAATTAAAAAAAAGCAACAACAAACCTTGA 

t371.nt 

gaagactctcttcttctatataaagaaggaaaatttaaagaagctattttaaacacgttagaagaaattcgactaa 

atcctagtaacttagatgctaggacaatattgatatggagcttaatagccataggagaatacaagagagctgaaaa 

agaggcgattataggacttggcattaaaaaacatgacataagaattattcaagcactaggagaagcttatttcot 

caaaaaaattatgacaatgcattaaaatactttcaagaatacattagccttgattctaaaggagcaagaataa 

aagtttataatttaattgcagattctttttatgagctaaaaagatataatgaagccgattttgcatacgaacatg 

attacgtttttctcctaataaccaaaatctattaataaaattagcaagatcaagaataaatgcaaaa;^ 

ttagcagaagaagcactaattaaaattcttacaatctctcctaataatctagaggcaaaaaatttactag^ 

taaaaaaaagcaacaacaaaccttga 
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f502 .aa 

MKKANFLSTNFLILLLVCFVNVNLFSKXJIFKFKLVDQFFPFYYKNNKGEY^^ 

LNESEIEDEAIYLGLTYNVKIJSrDFFYFKSEIJUlSISILFFKNSNKKYKNTHSTFLSNFN^ 

VNTIFLADNSQELVLALKNDKVDYIYGDCKTLHYIANISFFL 

FSYLMKMPEELVFSFLDSNAKGSFVDVGLYNDYPPLSFINSQGKLSGILVDLWNLLSRQHIFKPIFKGFSKEDIKK 
SLDGKSVGIFGGIISNDSVLENVNYWSKPIYPLNFKFYSKDLSNDAGPINSQFIDFNFNNIQLNK^ 
IVNNSYGFIENSITTKYLLKIiNGYNGRLKSYDSIFNKNRFLVLAIDNRIYKVIKYILNSIFDD^ 
LDKEEINSSRINSYKIMNKVKFNIEEKIWLSKNNKLNLAVKNWYPIDYVEANNYKGIN^ 

KVHSSLDLKKLIKSGKIDMLNTNATDSNLDNVFNIKLNSRIPLYIFSNKKRVLPSRSLEKFAILDFLYSKNLASNI 

KSKLILVSSFNEALLLLYKGKVDGIISDEYTAAAVFEELNIDDVEKIPTFRDLAFDLSLAIYNQDYILKEIIQKW 

MRSNVDSQMYLNDWKFDIYYKSRSIRFKNFKFLVITFIIFYFTFLGFVIIFMFRLSFEQKRRYSFV^^ 

NAAKTIFIAWSHDIRTPINGIMAATELLDTTILTDVQKDYVRMIlSnfSSDSLLSLIDDILYLSKIDVNELY^ 

IDLESEMEMVLKAFQSQCAKKNIDLFSYSKSIFNNYIKGDIVKIKQVLINLIGNAFKFTDDGVIVLNYEEVCRTRT 

IXSNRVLVTVEFKVIiyrGKGIEKENFSKIFEIFKQEDDSSSRVHEGAGLGLSISRELIRLMGGLGIAVDSKVGEGTT 

FSFMLPFLLGSELKSKKLSINRFQSWGDNKVLNVLLSQKSIKIFEHCSILLGCSSNVRYVASFEDAYKVFKKYPS 

YNFVYINVNNDNIQEGIRLANNIERUSrSDVQIIFLFYYLDNKALKNLKYGYV^ 

EDI^^JDSALRIKEPINVLIAEDNQVNQKVLKDILWIGINENFIDVVDIX^VKALKSLKDKKYTISFIDIRMPRYD 

GFSVAKEIRKFEKAKNLKPCVLVAVTAHALQEYKDKCLASGMNDYISKPIHISSIKTILKKYLQFEVDDIGENENL 

NQLVKFPNLDVNRALKELNLSYVSYSELCRGLVDFISINIIDLEKAFDEEDLSLIKDISHSISGALSNMRSELYKD 

FQKIETSKDSISELKKiyn^SFVKDDLFQLISDIKENILFESEIVSENKLYFKNlSroQFIJ^LLNKLL 

EILESINKYVLDDNIQVLFSDLRRNLRLYRFAESSKILEEIIEMLNNKRY 

t502 .aa 

cfviwnlfskdifkfklvdqffpfyyknnkgeyeglifsildkwakdi^adimvehidnlne 

nwlndffyfkselarsisilffknsnkkyknthstflsnfnigvikntiyedilrlknvntifl;^ 

kl^k^yiygdcktlhyiannflsedlviftgdvfysiknrvaisrnapeivknl^ 

snakgsfvdvglyndypplsfinsqgklsgilvdlwnllsrqhifkpifkgfskedikksldgksvgifggiisnd 
svlelwnywskpiyplnfkfyskdlsndagpinsqfidfnfmiqlnki^ikdivnnfidiv^ 
llklngyngrlksydsifnknrflvlaidnriykvikyilnsifddisfesllqidknwldkeeinssrinsykim 
nkvkfnieekiwlsknnki^lavkiawypidyvean^ 

dmijsrtnatdsnldnvfniklnsriplyifsnkkrviipsrslekfaildflysknlasniksklilvssfneallll 

YKGKVIX3I I SDEYTAAAWEELNIDDVEKI PTFRDIJ^FDLSI^I YNQDYI LKEI IQKVVMRSNTO 
lYYKSRSIRFKNFKFLVITFIIFYFTFLGFVIIFMFRLSFEQKRRYSFVMNEKKIAEAANAAKTIFIANVSHDIRT 
PINGIMAATELLDTTILTDVQKDYVRMINYSSDSLLSLIDDILYLSKIDVNELYVESQEIDLESEMEMV^ 
CAKKNI DLFS YSKS I FNlT!f IKGDI VKI KQVL INLIGNAFKFTDDGVI VLNYEEVCRTRTDGN^ 

KGIEKENFSKIFEIFKQEDDSSSRVHEGAGLGLSISRELIRLMGGLGIAVDSKVGEGTTFSFMLPFLLGSELKSKK 

LSIimFQSVNGDNKVLNVLLSQKSIKIFEHCSILIXX:SSlWRYVASFEDAYK^ 

RIJ^NNI ERLNSDVQI I FLFYYLDNKALKNLKYGYVKKPIiMGIXSI CS ILYKKE 

VLIAEDNQVNQKVLKDILWIGINENFIDVVDDGVKALKSLKX>KKYTISFIDIRMPRYIX?F 

KPCVLVAWAHALQEYKDKCIiASGMNDYISKPIHISSIKTILKKYLQFEVDDIGENENIi^ 

LNLSYVSYSELCRGLVDFISINIIDLEKAFDEEDLSLIKDISHSISGALSNMRSELYKDFQKIETSKDSISELKKM 
YSFVKDDLFQLISDIKENILFESEIVSENKLYFKNNDQFLNLLNKLLIGIKTRKPREYKEILESINKYVLDDNIQV 
LFSDLRRNLRLYRFAESSKILEEIIEMIiNNKRY 

£502. nt 

atgaaaaaagcaaactttttaagtactaattttttaattttacttttggtttgctttgtci^ 

ctaaggatattttcaagtttaagcttgtagatcaattttttcctttttactacaagaataataaaggagaatatga 

aggacttattttttctattttagataaatgggcaaaagataataatgctgatattatggttgagcatattgataat 

ttaaatgaaagtgaaattgaagacgaagcaatatatttaggattaacttataatgtat^aa 

attttaaaagtgagcttgctaggagtatttcaattttattttttaaaaactctaataa;^^ 

ttcaacatttttatccaattttaatataggagttattaaaaatacaatatatgaagatatctta^ 

gttaacaccatttttttggctgataattctcaagagttagtattggcctt;^^ 
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ATGGTGATTGCAAGACTTTACATTATATTGCAAATAACTTTTTAAGTGAAGATCTTGTGATTTTTACCGGGGATGT 

TTTTTATAGTATCAAAAATAGAGTGGCTATTAGTAGAAATGCTCCTGAGATAGTAAAGAATTTGAATTTAGATTTG 

TTTTCATATTTAATGAAAATGCCTGAGGAACTTGTTTTTTCTTTTTTAGATAGCAATGCTAAGGGAAGTTTTGTTC 

ATGTTGGTTTATATAATGATTATCCTCCTTTAAGTTTTATTAATTCACAGGGAAAATTGTCTGGCATTTTAGTC^ 

TTTGTGGAATCTTCTCTCAAGACAACATATCTTTAAACCTATTTTTAAGGGATTTTCCAAAGAGGATATTAAGAAA 

TCATTAGATGGAAAATCAGTAGGTATTTTTGGAGGAATTATTAGCAATGATAGTGTGTTGGAAAATGTTAATTATC 

TAGTAAGTAAGCCAATATATCCTCTTAATTTTAAATTTTATTCTAAAGACCTAAGCAATGATGCTGGTCCAATAAA 

TTCTCAGTTTATTGATTTTAATTTTAATAATATTCAATTAAATAAGAATAAAGATATTGTTAATAACTTTATAGAT 

ATTGTTAATAATTCATATGGGTTTATAGAAAATTCAATAACAACAAAATATTTGTTAAAATTAAAT^ 

GTAGATTAAAATCTTACGATTCGATTTTTAATAAAAATAGGTTTTTAGTATTAGCCATTGATAATAGGATTTATAA 

GGTTATTAAATATATTCTCAATTCTATATTTGATGATATTTCATTTGAATCTTTGCTTCAAATAGATAAAAATTGG 

TTGGATAAAGAAGAGATTAATAGTTCTAGAATAAATAGTTATAAAATTATGAATAAGGTTAAATTTAATATAGAAG 

AAAAAATTTGGTTATCAAAAAATAATAAATTAAATCTTGCTGTTAAAAATTGGTATCCAATAGATTATGTTGAGGC 

AAATAATTATAAAGGAATAAATCAATTTTTGCTTGATAAGATTAGAATGTTTTCAGGTTTGAGATTTAACATAATT 

AAAGTACACAGCAGTTTAGATCTTAAAAAATTAATCAAATCTGGAAAAATCGATATGCTAAATACTAATGCAACCG 

ATTCAAATTTAGATAATGTTTTCAACATAAAATTAAATTCTCGAATTCCACTTTATATTTTTTCAAATAAGAAAAG 

GGTGCTTCCATCTAGATCTTTAGAAAAGTTTGCTATACTTGATTTTTTATATAGTAAAAATTTGGCTTCTAATATT 

AAATCAAAGCTTATTCTGGTAAGCAGTTTTAATGAAGCGTTGCTTCTTCTTTATAAGGGAAAGGTAGATGGGATTA 

TTAGCGATGAGTATACAGCTGCTGCTGTTTTTGAGGAATTAAATATTGATGATGTTGAAAAAATTCCTACTTTTAG 

AGATTTGGCTTTTGATTTGAGTCTTGCTATTTATAATCAAGATTATATCTTGAAAGAAATTATTCAAAAAGTTGTT 

ATGCGTTCAAATGTTGACAGTCAGATGTATTTAAATGATTGGAAATTTGATATTTATTATAAATCCAGAAGTATCA 

GGTTTAAAAATTTCAAATTTTTAGTGATAACATTCATTATATTTTATTTTACTTTTTTAGGATTTGTAATTATATT 

tatgttcagattatcatttgagcagaaaagaagatattcttttgtgatgaatgaaaaaaagattgcggaagccgct 

aatgctgctaaaaccatttttatagccaatgtcagtcatgatattcgtacccctattaacggaataatggcggcta 

ctgagcttttggatacaactattcttacagatgttcaaaaagattatgttaggatgataaattattcatctgattc 

tttgctttctttaattgatgatatattgtatttgtctaaaatagatgtcaatgaattatatgttgagagtcaagag 

attgatttagagagtgaaatggaaatggttttaaaagcttttcaatctcaatgtgcaaagaaaaatattgatttat 

tctcttattctat^tctatttttaataattatataaagggtgatattgtaaaaattaaacaagttttaattaattt 

aataggaaatgcttttaagtttacagatgatggtgttattgttttaaattatgaagaagtatgtagaacaagaact 

gatggtaatagggttttggttacagttgaatttaaggtaatagatacaggcaaagggattgaaaaagaaaattttt 

ctaagatatttgaaatatttaaacaagaggatgattcttcttcaagggttcatgaaggtgcaggattgggattgtc 

aatatctagagagcttataagactaatgggtggtcttggtattgctgttgatagcaaggtgggagagggtacaact 

ttttcatttatgttgccctttttattgggtagtgagcttaaaagtaaaaaattgtcaatcaatagatttcaatcag 

taaatggtgacaataaagtattaaatgtgcttttaagtcaaaaatctattaaaatttttgagcactgttcgatttt 

attgggatgctcttctaatgtgcgctatgtagcgtcttttgaggatgcttataaagtcttcaagaaatacccttct 

tataattttgtttatataaatgtaaataacgataatattcaagagggtattcgacttgccaataatattgaaagac 

taaattctgatgtacaaattatttttttattttattatttagataataaagctctaaaaaatttaaaatatggot 

tgttaaaaagcctttaatggggcttggtatatgctctattctttataaaaaagagtttaacccagaaatggatttt 

gaggatttggttccaatagatagtgctttaaggataaaagagcccattaatgttttaatagctgaaga 

taaatcaaaaagtgttgaaagatattcttgttgttataggcattaatgaaaattttattgatgttgtagatgatc^ 

agtaaaggctttaaaatctttaaaagataaaaaatatactatctcttttattgatatacgaatgccaagatatgat 

ggattttcggtggctaaggaaattagaaaatttgaaaaggcaaagaatttaaagccttgtgttttggttgctgta^ 

cagcgcatgctttgcaagagtataaagacaagtgtcttgcaagtggtatgaatgattatatctcaaaaccaataca 

cataagttcaattaaaactatattaaaaaaatacttacagtttgaagttgatgatattggggagaatgaaaatttc 

aatcaacttgttaagtttcctaatttagatgttaatagggctttaaaagaattaaatctttcatatgtatcatatt 

ctgaattatgtagagggcttgttgattttatctctattaatattattgatttggaaaaagcttttgatgaggaaga 

tttgtctttaattaaggatatatctcattcaatatctggagctctttctaatatgcgtagcgaattgtataaagat 

tttcaaaaaattgaaacaagtaaagattcaatttctgagttgaaaaaaatgtattcttttgtaaaagatgat^ 

ttcaactaataagcgacataaaggaaaatattttgtttgagtctgagattgttagtgagaacaagctatattttaa 

aaataatgatcaatttttaaaccttctcaacaaacttttaattggtattaagactagaaagccaagag^ 

gaaattcttgagagcattaataaatatgttttagacgataatattcaggtattatttagtgatcttcgcag;^ 

taagattatatagatttgctgagagctctaagattcttgaagagattattgaaatgcttaataataagagatatta 

G 



t502.nt 
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TGCTTTGTCAACGTCAATTTATTTTCTAAGGATATTTTCAAGTTTAAGCTTGTAGATCAATTT^ 

ACAAGAATAATAAAGGAGAATATGAAGGACTTATTTTTTCTATTTTAGATAAATGGGCAAAAGATAATAATGCTGA 

TATTATGGTTGAGCATATTGATAATTTAAATGAAAGTGAAATTGAAGACGAAGCAATATATTTAGGATTAACTTAT 

AATGTAAAATTAAATGATTTTTTTTATTTTAAAAGTGAGCTTGCTAGGAGTATTTCAATTTTAT^ 

CTAATAAAAAATATAAAAATACCCATTCAACATTTTTATCCAATTTTAATATAGGAGTTATTAAAAATACAATATA 

TGAAGATATCTTAAGGTTAAAAAACGTTAACACCATTTTTTTGGCTGATAATTCTCAAGAGTTAGTATTGGCCTTA 

AAAAACGATTU^GTTGATTATATATATGGTGATTGCAAGACTTTACATTATATTGCAAATAACTTTTTAAGTGAAG 

ATCTTGTGATTTTTACCGGGGATGTTTTTTATAGTATCAAAAATAGAGTGGCTATTAGTAGAAATGCTCCTGAGAT 

AGTAAAGAATTTGAATTTAGATTTGTTTTCATATTTAATGAAAATGCCTGAGGAACTTGTTTTTTCTTTTTTAGAT 

agcaatgctaagggaagttttgttgatgttggtttatataatgattatcctcctttaagttttattaattcacagg 
gaaaattgtctggcattttagtggatttgtggaatcttctctcaagacaacatatctttaaacctatttttaaggg 
attttccaaagaggatattaagaaatcattagatggaaaatcagtaggtatttttggaggaattattagcaatgat 
agtgtgttggaaaatgttaattatgtagtaagtaagccaatatatcctcttaattttaaattttattctaaagacc 
taagcaatgatgctggtccaataaattctcagtttattgattttaattttaataatattcaattaaataagaataa 
agatattgttaataactttatagatattgttaataattcatatgggtttatagaaaattcaataacaacaaaatat 
ttgttaaaattaaatggatataacggtagattaaaatcttacgattcgatttttaataaaaataggtttttagtat 
tagccattgataataggatttataaggttattaaatatattctcaattctatatttgatgatatttcatttgaatc 
tttgcttcaaatagataaaaattggttggataaagaagagattaatagttctagaataaatagttataaaattatg 
aataaggttaaatttaatatagaagaaaaaatttggttatcaaaaaataataaattaaatcttgctgttaaaaatt 
ggtatccaatagattatgttgaggcaaataattataaaggaataaatcaatttttgcttgataagattagaatgtt 
ttcaggtttgagatttaacataattaaagtacacagcagtttagatcttaaaaaattaatcaaatctggaaaaatc 
gatatgctaaatactaatgcaaccgattcaaatttagataatgttttcaacataaaattaaattctcgaattccac 
tttatattttttcaaataagaaaagggtgcttccatctagatctttagaaaagtttgctatacttgattttttata 
tagtaaaaatttggcttctaatattaaatcaaagcttattctggtaagcagttttaatgaagcgttgcttcttcto 
tataagggaaaggtagatgggattattagcgatgagtatacagctgctgctgtttttgaggaattaaatattgatg 
atgttgaaaaaattcctacttttagagatttggcttttgatttgagtcttgctatttataatcaagattatatctt 
gaaagaaattattcaaaaagttgttatgcgttcaaatgttgacagtcagatgtatttaaatgattggaaatttgat 
atttattataaatccagaagtatcaggtttaaaaatttcaaatttttagtgataacattcattatattttatttta 
cttttttaggatttgtaattatatttatgttcagattatcatttgagcagaaaagaagatattcttttgtgatgaa 
tgaaaaaaagattgcggaagccgctaatgctgctaaaaccatttttatagccaatgtcagtcatgatattcgtacc 
cctattaacggaataatggcggctactgagcttttggatacaactattcttacagatgttcaaaaagattatgtta 
ggatgataaattattcatctgattctttgctttctttaattgatgatatattgtatttgtctaaaatagatgtcaa 
tgaattatatgttgagagtcaagagattgatttagagagtgaaatggaaatggttttaaaagcttttcaatctcaa 
tgtgcaaagaaaaatattgatttattctcttattctaaatctatttttaataattatataaagggtgatattgt;^ 
aaattaaacaagttttaattaatttaataggaaatgcttttaagtttacagatgatggtgttattgttttaaatta 

TGAAGAAGTATGTAGAACAAGAACTGATGGTAATAGGGTTTTGGTTACAGTTGAATTTAAGGTAATAGATAC^ 

aaagggattgaaaaagaaaatttttctaagatatttgaaatatttaaacaagaggatgattcttcttcaagggttc 

atgaaggtgcaggattgggattgtcaatatctagagagcttataagactaatgggtggtcttggtattgctgttga 

tagcaaggtgggagagggtacaactttttcatttatgttgccctttttattgggtagtgagcttaaaagta 

ttgtcaatcaatagatttcaatcagtaaatggtcacaataaagtattaaatgtgcttttaagtcaaaaatctatta 

aaatttttgagcactgttcgattttattgggatgctcttctaatgtgcgctatgtagcgtcttttga 

taaagtcttcaagaaatacccttcttataattttgtttatataaatgttvaataacgataatattcaagagggtatt 

cgacttgccaataatattgaaagactaaattctgatgtacaaattatttttttattttattatttagataata;^ 

ctctaaaaaatttaaaatatggttatgttaaaaagcctttaatggggcttggtatatgctctattctttataaaaa 

agagtttaacccagaaatggattttgaggatttggttccaatagatagtgctttaaggataaaagagcccattaat 

gttttaatagctgaagataatcaggtaaatcaaaaagtgttgaaagatattcttgttgttataggcattaatc 

attttattgatgttgtagatgatggagtaaaggctttaaaatctttaaaagataaaaaatatactatctc 

tgatatacgaatgccaagatatgatggattttcggtggctaaggaaattagaaaatttg/vaaaggcaaag^ 

aagccttgtgttttggttgctgtaacagcgcatgctttgcaagagtataaagacaagtgtcttgcaagtggt^ 

atgattatatctcaaaaccaatacacataagttcaattaaaactatattaaaaaaatacttacagtttgaagttga 

tgatattggggagaatgaaaatttgaatcaacttgttaagtttcctaatttagatg 

ttaaatctttcatatgtatcatattctgaattatgtagagggcttgttgattttatctctattaatattat^ 

tggaaaaagcttttgatgaggaagatttgtctttaattaaggatatatctcattcaatatctggagctctt^ 

tatgcgtagcgaattgtataaagattttcaaaaaattgaaacaagtaaagattcaatttctgagttgaaa;^^ 

tattcttttgtaaaagatgatttatttcaactaataagcgacataaaggaaaatattt^ 

ttagtgagaacaagctatattttaaaaataatgatcaatttttaaaccttctcaacaaacttttaato 



wo 98/59071 



PCT/US98/12718 



149 

TABLE 1. Nucleotide and Amino Acid Sequences 

GACTAGAAAGCCAAGAGAATACAAAGAAATTCTTGAGAGCATTAATAAATATGTTTTAGACGATAATATTCAGGTA 
TTATTTAGTGATCTTCGCAGAAATTTAAGATTATATAGATTTGCTGAGAGCTCTAAGATTCTTGAAGAGATTATTG 
AAATGCTTAATAATAAGAGATATTAG 

f527.aa 

imLLVKIAKFILILFLFTSCNQKQSEIQNLTHLLKSSNKNRLDKFLIIDRVVNIYIANKNYEDALEI^^ 
SREYYPLYLYIJyrGNIYDSMGEDFVAFNIYKRVVDNFDDYVYENHSMKTRVAKKIVNIJ^riDSIDKI^^^ 
DNLNNEEKGl^FYNIiALSLEDVQDYDESYFYYKKFLSIPRAHLKIDSRDYFNVWKINYFI^ 
DVKNFVLSGNTSKLLNI RDKNNFF IQSWDQKGGKSNS INTNS FLTTMI RIX5^ 
SRGVTOHIHEWYFVFKRIVYPKDPEINNGWTWIGVYIXSKK 

t527 .m 

CNQKQSEIQNLTHLLKSSNKNRLDKFLIIDRVVNIYIANKNYEDALEIVNNGIIDDESREYYPLYLYLM 

GEDFVAFNIYKRVVDNFDDYWENHSMKTRVAKKIVNLNIDSIDKINYYKFILNMGIDNI^ 

EDVQDYDESYFYYKKFLSIPRAHLKIDSRDYFNVVTKINYFNNPEFVVYRNLGDLIQDVKNW 

KlSnsrFFIQSWDQKGGKSNSINTNSFLTTMIRLGGRRKNGIQFAKHLEADSSDDISYLESRGWDHIHEWYFVFK^ 

PKDPEINNGWTWIGVYLGKK 

f527.nt 

ATGAATCTATTGGTCAAAATTGCTAAATTTATTTTGATTTTGTTTTTATTTACTTCTTGCAACCAAAAGCAAAGCG 

AGATTCAAAATCTTACACATCTTTTAAAATCTTCTAATAAAAATAGATTAGATAAATTTCTTATTATTGATAGAGT 

TGTTAACATATATATTGCAAATAAAAATTATGAAGATGCTTTAGAAATTGTAAATAATGGAATTATTGATGATGAA 

TCTAGAGAATATTATCCTTTGTATCTTTATTTAATGGGCAATATTTATGATTCCATGGGAGAAGATTTTGTAGCTT 

TTAATATTTACAAGCGTGTTGTTGATAATTTTGATGATTATGTTTATGAAAACCATTCAATGAAAACAAGGGTTGC 

TAAAAAGATTGTCAATTTAAATATTGATTCAATCGATAAAATCAATTATTACAAATTTATATTAAATATGGGGATT 

GATAATTTAAATAATGAGGAAAAGGGTAATTATTTTTATAATCTTGCGCTAAGTTTGGAAGATGTTCAAGATTACG 

ATGAATCTTATTTTTATTATAAAAAATTTCTTTCAATTCCAAGGGCACATTTAAAAATAGATTCTAGA 

TAATGTTGTTACAAAAATTAATTACTTTAATAATCCAGAGTTTGTTGTTTATAGAAATTTAGGAGATTTAATCCAG 

GATGTTAAAAATTTTGTTCTTTCTGGTAATACTTCTAAATTGCTTAATATAAGAGATAAGAATAATTTTTTTATTC 

AAAGCTGGGATCAAAAGGGTGGAAAGAGTAATTCCATTAATACTAATAGCTTTTTAACCACTATGATTAGGCTTGG 

GGGGAGAAGAAAAAACGGAATACAATTTGCAAAGCATCTTGAGGCAGATTCTAGTGACGATATATCTTATCTTGAG 

TCAAGGGGCTGGGACCATATTCATGAATGGTATTTTGTTTTTAAAAGAATTGTTTATCCTAAAGATCCAGAAATTA 

ATAATGGCTGGACTTGGATAGGCGTGTATTTAGGTAAAAAATAA 

t527.nt 

tgcaaccaaaagcaaagcgagattcaaaatcttacacatcttttaaaatcttctaataaaaatagattagataaat 

ttcttattattgatagagttgttaacatatatattgcaaataaaaattatgaagatgctttaga^ 

tggaattattgatgatgaatctagagaatattatcctttgtatctttatttaatgggcaatatttatc 

ggagaagattttgtagcttttaatatttacaagcgtgttgttgataattttgatgattatgtttatgaaaaccato 

caatgaaaacaagggttgctaaaaagattgtcaatttaaatattgattcaatcgataaaatcaatt^ 

tatattaaatatggggattgataatttaaataatgaggaaaagggtaattatttttataatcttgcg 

gaagatgttcaagattacgatgt^tcttatttttattataaaaaatttctttcaattccaagggcacatttaaaaa 

tagattctagagactattttaatgttgttacaaaaattaattactttaataatccagagtttgttgtttatagaaa 

tttaggagatttaatccaggatgttaaaaattttgttctttctggtaatacttctaaattgcttaatataagagat 

aagaataatttttttattcaaagctgggatcaaaagggtggaaagagtaattccattt^tactaatagcttttt;^ 

ccactatgattaggcttggggggagaagaaaaaacggaatacaatttgcaaagcatcttgaggcagattctagtga 

CGATATATCTTATCTTGAGTCAAGGGGCTGGGACCATATTCATGAATGGTATTTTGTTTTTAAAAGA^ 
CCTAAAGATCCAGAAATTAATAATGGCTGGACTTGGATAGGCGTGTATTTAGGTAAAAAATAA 

f541.aa 

M^mILLLILLESIVFLSCSGKGSLGSEIPKVSLIIIX3TFDDKSFNESArJ^GVKKVK£EFKIEL\^JK^ 
LEGLKDAGSDLIWLIGYRFSDVAKVAALQNPDMKYAIIDPIYSNDPIPANLVGMTFRAQEGAFLTGYIAAKLSKTG 
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KIGFLGGIEGEIVDAFRYGYEAGAKYANKDIKISTQYIGSFADLEAGRSVATRKnirSDEIDIIHHAAGLG^ 
AKELGSGHYI IGVDEDQAYIAPDNVITSTTKDVGRAIJ^I FTSNHLKTNTFEGGKLINYGLKEGWGFVR^ 
ELEKEXDNLSSKIINKEIIVPSNKESYEKFLKEFI 

t541.aa 

CSGKGSIXSSEIPKVSLIIDGTFDDKSFNESAIJSrGVKKVKEEFKIELVliKESSSNSYLSDLEGLKDAGSDLIWLIGY 

RFSDVAKVAALQNPDMKYAIIDPIYSNDPIPANLVGl^FRAQEGAFLTGYIAAKLSKTGKIGFLGGIEGEIVT^^ 

YGYEAGAKYANKDIKISTQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEVAKELGSGHYIIGVDEDQ 

AYrJ!^PDWITSTTKDVGRALNIFTSNHLKTNTFEGGKLINYGLKEGWGFVRNPKMISFELEKEIDNLSSKIINK^ 

IIVPSNKESYEKFLKE 

FI 

f541.nt 

ATGAATAAAATATTGTTGTTGATTTTGCTTGAGAGTATTGTTTTTTTATCTTGTAGTGGTAAAGGTAGTCTTGGGA 

GCGAAATTCCTAAGGTATCTTTAATAATTGATGGAACTTTTGATGATAAATCTTTTAATGAGAGTGCTTTAAATGG 

CGTAAAAAAAGTTAAAGAAGAATTTAAAATTGAGCTTGTTTTAAAAGAATCCTCATCAAATTCTTATTTATCTGAT 

CTTGAAGGGCTTAAGGATGCGGGCTCAGATTTAATTTGGCTTATTGGGTATAGATTTAGCGATGTGGCCAAGGTTG 

CGGCTCTTCAAAATCCCGATATGAAATATGCAATTATTGATCCTATTTATTCTAACGATCCTATTCCTGCAAATTT 

GGTGGGCATGACCTTTAGAGCTCAAGAGGGTGCATTTTTAACGGGTTATATTGCTGCAAAACTTTCTAAAACAGGT 

AAAATTGGATTTTTAGGGGGAATAGAAGGCGAGATAGTAGATGCTTTTAGGTATGGGTATGAAGCTGGTGCTAAGT 

ATGCTAATAAAGATATAAAGATATCTACTCAGTATATTGGTAGTTTTGCTGACCTTGAAGCTGGTAGAAGCGTTGC 

AACTAGGATGTATTCTGATGAGATAGACATTATTCATCATGCTGCAGGCCTTGGAGGAATTGGGGCTATTGAGGTT 

GCAAAAGAACTTGGTTCTGGGCATTACATTATTGGAGTTGATGAAGATCAAGCATATCTTGCTCCTGACAATGTAA 

TAACATCTACAACTAAAGATGTTGGTAGAGCTTTAAATATTTTTACATCTAACCATTTAAAAACTAATACTTTCGA 

AGGTGGCAAATTAATAAATTATGGCCTTAAAGAAGGAGTTGTGGGGTTTGTAAGAAATCCTAAAATGATTTCCT^ 

GAACTTGAAAAAGAAATTGACAATCTTTCTAGCAAAATAATCAACAAAGAAATTATTGTTCCATCTAATAAAGAAA 

GTTATGAGAAGTTTCTTAAAGAATTTATTTAA 

t541.nt 

TGTAGTGGTAAAGGTAGTCTTGGGAGCGAAATTCCTAAGGTATCTTTAATAATTGATGGAACTTTTGATGATAAAT 

CTTTTAATGAGAGTGCTTTAAATGGCGTAAAAAAAGTTAAAGAAGAATTTAAAATTGAGCTTGTTTTAAAAGAATC 

CTCATCAAATTCTTATTTATCTGATCTTGAAGGGCTTAAGGATGCGGGCTCAGATTTAATTTGGCTTATTGGGTAT 

AGATTTAGCGATGTGGCCAAGGTTGCGGCTCTTCAAAATCCCGATATGAAATATGCAATTATTGATCCTATT^ 

CTAACGATCCTATTCCTGCAAATTTGGTGGGCATGACCTTTAGAGCTCAAGAGGGTGCATTTTTAACGGGTTATAT 

TGCTGCAAAACTTTCTAAAACAGGTAAAATTGGATTTTTAGGGGGAATAGAAGGCGAGATAGTAGA^^ 

TATGGGTATGAAGCTGGTGCTAAGTATGCTAATAAAGATATAAAGATATCTACTCAGTATATTGGTAGTTTTGCTG 

ACCTTGAAGCTGGTAGAAGCGTTGCAACTAGGATGTATTCTGATGAGATAGACATTATTCATCATGCTGCAGGCCT 

TGGAGGAATTGGGGCTATTGAGGTTGCAAAAGAACTTGGTTCTGGGCATTACATTATTGGAGTTG 

GCATATCTTGCTCCTGACAATGTAATAACATCTACAACTAAAGATGTTGGTAGAGCTTTAAATATTTTTACATC 

ACCATTTTUVAAACTAATACTTTCGAAGGTGGCAAATTAATAAATTATGGCCTTAAAGAAGGAGTTGTGG^ 

AAGAAATCCTAAAATGATTTCCTTTGAACTTGAAAAAGAAATTGACAATCTTTCTAGCAAAATAATCAAC^^ 

ATTATTGTTCCATCTAATAAAGAAAGTTATGAGAAGTTTCTTAAAGAATTTATTTAA 

f561.aa 

MYKNGFFKNYLSLFLIFLVIACTSKDSSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSLGKNI 
KVFYFSEEDGHFQTIPSKENAKLIVYFYDNVYAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKD 
U^IKNSKLEITVDENNSDAKTFLESVNYIIDGVEKISPMLTN 



tSei.aa 
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CTSKDSSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSI^KNIKVFYFSEEIX3HFQTIPSKE1^ 
KlilVYFYDNWAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKDLNIKNSKLEITTO 
FLESVNY 1 1 DGVEKI S PMLTN 

f561.nt 

ATGTATAAAAATGGTTTTTTTAAAAACTATTTGTCATTGTTTTTAATTTTTTTAGTAATTGCTTC 

ATAGCTCAAATGAATATGTTGAGGAGCAAGAAGCGGAGAACTCTTCTAAGCCTGATGATTCTAAAATAGATGAACA 

TACTATTGGGCACGTTTTTCACGCTATGGGAGTAGTTCATTCAAAAAAGGATCGAAAAAGTTTGGGGAAAAATATA 

AAGGTTTTTTATTTTTCTGAAGAAGATGGACATTTTCAAACAATACCCTCAAAAGAGAATGCAAAGTTAATAGTTT 

ATTTTTATGACAATGTTTATGCAGGAGAGGCTCCAATTAGTATCTCTGGAAAAGAAGCCTTTATTTTTGTTGGGAT 

TACCCCTGACTTTAAAAAGATTATAAATAGCAATTTACATGGCGCTAAAAGTGATCTTATTGGTACTTTTAAAGAT 

CTTAATATTAAAAATTCAAAATTGGAAATTACAGTTGATGAGAATT^TTCAGATGCCAAGACCTTCCTTGAATCTG 

TTAATTACATTATCGACGGCGTTGAAAAAATTTCACCTATGTTAACGAATTAA 

t561.nt 

TGTACTTCAAAAGATAGCTCAAATGAATATGTTGAGGAGCAAGAAGCGGAGAACTCTTCTAAGCCTGATGATTCTA 

AAATAGATGAACATACTATTGGGCACGTTTTTCACGCTATGGGAGTAGTTCATTCAAAAAAGGATCGAAAAAGTTT 

GGGGAAAAATATAAAGGTTTTTTATTTTTCTGAAGAAGATGGACATTTTCAAACAATACCCTCAAAAGAGAATGCA 

AAGTTAATAGTTTATTTTTATGACAATGTTTATGCAGGAGAGGCTCCAATTAGTATCTCTGGAAAAGAAGCCTTTA 

TTTTTGTTGGGATTACCCCTGACTTTAAAAAGATTATAAATAGCAATTTACATGGCGCTAAAAGTGATCTTATTGG 

TACTTTTAAAGATCTTAATATTAAAAATTCAAAATTGGAAATTACAGTTGATGAGAATAATTCAGATGCCA^ 

TTCCTTGAATCTGTTAATTACATTATCGACGGCGTTGAAAAAATTTCACCTATGTTAACGAATTAA 

f604.aa 

MSFNKTKKIGKKIKIVTLLMI^VSLIACNNWSEKEKLAFKVYIGGAPSSLDPHLVDETIGARILEQIFSGLLTLNT 
KTGKLKPGUUCNWEASKDKKTYQFYLRDNLFWSIXSVEITAEGIRKSFLRILNKETGSTK^ 

KVSDSELGIKAIDSKTLEITLTAPKPYFLELLLHYAFMPVPIHVIEKYKGNWTSPENMVTSGPFKLKKRLPNEKII 

FEKNERYYNAKEVELDELVYITSDNDLTVYNMYKimEIDAIFNSIPPDIVNEIKLQKDYYQHKS^^ 

KPLDDARWEALTI^IDRETLTYKVLNDGTVPTREITPDLKNYNYGKKIJ^FDPEKSKKLLADAGYPN 

LKYNTNETHKKIAAFIQNQWKKILNINLMLTNENWPVLTNSRNTGNFEIIRVGRIGEYLDPHTYFTIFTRENSQLA 

SYGYSNLEFDKLIRESDLEKDPIKRKQLLRKAESIIIEKDFPAAPIYIYSGHYLFRNDKWTGWNPNVSEVYYLSEL 

KPIKNAKHN 

t604 .aa 

CNNNSEKEKLAFKVYIGGAPSSLDPHLVDETIGARILEQIFSGLLTIJSrTKTGKL 

DNLFWSIX5VEITAEGIRKSFLRILNKETGSTNVDMLKSIIKNGQEYFIX5KVSDSELGIKAIDSKTLEITLTAPKPY 
FLELLLHYAFMPVPIHVIEKYKGNVirrSPENMVTSGPFKLKKRLPNEKIIFEKNERYYNAKEVELDEL^ 
TVYNMYKNNEIDAIFNSIPPDIVNEIKLQKDYYQHKSNAIYLYSFNTKIKPLDDARWEA^ 
DGTVPTREITPDLKNY^^!fGKKLALFDPEKSKKLIlADAGYPNGKGFPMLTLKYNT^^ 

LMLTNENWPVLTNSRNTGNFEIIRVGRIGEYLDPHTYFTIFTRENSQLASYGYSNLEFDKLIRESDLEKDPIKRKQ 
LLRKAES 1 1 1 EKDF PAAPI YI YSGHYLFRJ^KWTGWNPNVS EVYYLSELKP I KNAKHN 

f604.nt 

ATGAGCTTTAATAAAACTAAAAAAATCGGTAAAAAAATTAAAATAGTAACACTACTTATGCTTGCTGTGTCTTTAA 

TTGCATGCAATAATAATTCAGAAAAAGAAAAATTAGCATTTAAAGTATACATAGGGGGAGCGCCCTCATCGCT^ 

CCCTCATTTGGTAGATGAGACAATAGGAGCAAGAATTTTAGAACAAATATTCTCAGGGCTTTTGACATTA^ 

AAAACAGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGAAGCCTCAAAAGATAAAAAAACATATCAATT 

TAAGGGACAACCTTTTTTGGAGCGATGGAGTTGAAATTACCGCTGAAGGGATAAGAAAATC 

AAATAAAGAAACAGGATCTACAAATGTTGACATGCTCAAATCAATAATAAAAAATGGACAAGAGTATT^ 

AAAGTATCCGATTCTGAACTTGGAATCAAGGCAATTGATAGTAAAACGCTGGAAATAACACTTACGGCC^ 

CATATTTTCTTGAACTGCTTCTACATTACGCATTCATGCCAGTACCTATTCATGT^ 
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TTGGACAAGCCCTGAAAACATGGTTACTAGCGGTCCTTTTAAATTAAAAAAAAGATTACCTAATGAAAAAATTATC 

TTTGAAAAAAACGAACGTTATTATAATGCAAAAGAAGTAGAACTTGATGAGCTTGTCTACATTACGTCTGACAATG 

ATCTTACTGTGTACAATATGTACAAAAACAACGAAATTGATGCTATTTTTAACAGCATCCCGCCGGACATTGTAAA 

TGAAATAAAACTACAAAAAGACTATTACCAACACAAAAGTAATGCAATTTATTTATATTCATTTAATACAAAAAT^ 

AAACCCCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCTTAGCTATTGACAGAGAAACTTTAACTTACAAAGT^ 

TAAATGATGGCACAGTTCCTACAAGAGAAATAACTCCTGATCTTAAAAATTACAATTACGGTAAAAAATTGGCTTT 

ATTTGATCCTGAAAAATCTAAAAAGCTTTTGGCAGATGCAGGGTATCCTAATGGGAAAGGATTCCCAATC 

CTAAAATATAATACAAACGAAACTCATAAAAAAATTGCTGCATTTATTCAAAACCAATGGAAAAAAATTCTAAATA 

TCAATCTTATGCTTACCAACGAAAATTGGCCTGTTCTTACCAACAGCAGAAATACTGGCAATTTTGAAATAA 

AGTTGGACGCATTGGGGAATATTTAGATCCACACACATACTTTACTATATTCACAAGAGAAAATTCACAACTTGCA 

TCATACGGATATTCAAACCTAGAATTTGACAAACTCATCAGAGAATCAGATCTTGAAAAAGATCCTATAAAAAGAA 

AACAATTACTCAGAAAAGCAGAATCAATAATAATTGAAAAAGATTTTCCTGCTGCACCAATATACATATATTCTGG 

GCATTATCTTTTTAGAAACGATAAATGGACTGGATGGAATCCTAATGTATCAGAGGTTTATTATCTTTCTGAATTA 

AAACCAATTAAAAATGCAAAACATAATTAA 

t604.nt 

TGCAATAATAATTCAGAAAAAGAAAAATTAGCATTTAAAGTATACATAGGGGGAGCGCCCTCATCGCTTGACCCTC 
ATTTGGTAGATGAGACAATAGGAGCAAGAATTTTAGAACAAATATTCTCAGGGCTTTTGACATTAAATACCAAAAC 
AGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGAAGCCTCAAAAGATAAAAAAACATATCAATTTTATCTAAGG 
GACAACCTTTTTTGGAGCGATGGAGTTGAAATTACCGCTGAAGGGATAAGAAAATCTTTTTTAAGAATTTTAAATA 
AAGAAACAGGATCTACAAATGTTGACATGCTCAAATCAATAATAAAAAATGGACAAGAGTATTTTGACGGGAAAGT 
ATCCGATTCTGAACTTGGAATCAAGGCAATTGATAGTAAAACGCTGGAAATAACACTTACGGCCCCAAAGCCATAT 
TTTCTTGAACTGCTTCTACATTACGCATTCATGCCAGTACCTATTCATGTGATTGAAAAATATAAGGGAAATTGGA 
CAAGCCCTGAAAACATGGTTACTAGCGGTCCTTTTAAATTAAAAAAAAGATTACCTJ\ATGAAAAAATTATCTTTGA 
AAAAAACGAACGTTATTATAATGCAAAAGAAGTAGAACTTGATGAGCTTGTCTACATTACGTCTGACAATGATCTT 
ACTGTGTACAATATGTACAAAAACAACGAAATTGATGCTATTTTTAACAGCATCCCGCCGGACATTGTAAATGAAA 
TAAAACTACAAAAAGACTATTACCAACACAAAAGTAATGCAATTTATTTATATTCATTTAATACAAAAATAAAACC 
CCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCTTAGCTATTGACAGAGAAACTTTAACTTACAAAGTGCTAAAT 
GATGGCACAGTTCCTACAAGAGAAATAACTCCTGATCTTAAAAATTACAATTACGGTAAAAAATTGGCTTTATTTG 
ATCCTGAAAAATCTAAAAAGCTTTTGGCAGATGCAGGGTATCCTAATGGGAAAGGATTCCCAATGCTAACACTAAA 
ATATAATACAAACGAAACTCATAAAAAAATTGCTGCATTTATTCAAAACCAATGGAAAAAAATTCTAAATATCAAT 
CTTATGCTTACCAACGAAAATTGGCCTGTTCTTACCAACAGCAGAAATACTGGCAATTTTGAAATAATAAGAGTTG 
GACGCATTGGGGAATATTTAGATCCACACACATACTTTACTATATTCACAAGAGAAAATTCACAACTTGCATCATA 
CGGATATTCAAACCTAGAATTTGACAAACTCATCAGAGAATCAGATCTTGAAAAAGATCCTATAAAAAGAAAACAA 
TTACTCAGAAAAGCAGAATCAATAATAATTGAAAAAGATTTTCCTGCTGCACCAATATACATATATTCTGGGCATT 
ATCTTTTTAGAAACGATAAATGGACTGGATGGAATCCTAATGTATCAGAGGTTTATTATCTTTCTGAATTAAAACC 
AATTAAAAATGCAAAACATAATTAA 

f736.aa 

MKKVIILIFMLSTSLLYNCKNQDNEKIVSIGGSTWSPILDEMILRYNKINlSnsrTKV^ 

lAISSIUDLTKEEIEQGAKETVFAYDALIFITSPEIKITNITEENLAKILNGEIQNWKQVGGPDAKINFIK^ 

SYSSIKDLLLNKIFKTHEEAQFRQDGIVVKSNGEVIEKTSLTPHSIGYIGLiGYAKNSIEKGLNILSVNSTYPTKET 

INSNKYTIKRNLIIVTNNKYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT 

t736 .aa 

CKNQDNEKIVSIGGSTTVSPILDEMILRYNKIiraNTKVTYDAQGSSVGINGLFNKIYKIAISSRDLTKEEIE 
ETVFAYDALIFITSPEIKITNITEENI^KILNGEIQNWKQVGGPDAKINFINRDSSSGSYSSIKDLLLNKIFKTHE 
EAQFRQTCIWKSNGEVIEKTSLTPHSIGYIGLGYAKNSIEKGLNILSWSTYPTKETINSI^YTIKiyS^ 
KYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT 



f736.nt 
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ATGAAAAAAGTTATTATCTTAATTTTTATGCTATCAACAAGTTTATTATACAACTGTAAAAATCAAGACAATGAAA 

AAATTGTATCAATTGGAGGATCTACAACTGTAAGCCCAATACTAGACGAAATGATTTTAAGATATAATAAAATAAA 

CAATAATACTAAAGTAACATACGATGCACAAGGAAGTAGTGTTGGCATAAACGGGCTATTTAACAAAATATATA^ 

ATAGCAATATCATCAAGAGATTTAACAAAAGAAGAAATTGAACAAGGGGCAAAAGAAACTGTATTTGCTTATGATG 

CTTTAATTTTCATTACAAGCCCTGAAATAAAAATTACAAATATTACAGAAGAAAATCTAGCTAAAATACTAAATG^ 

AGAAATTCAAAATTGGAAACAAGTGGGAGGTCCTGATGCTAAAATCAACTTTATCAATCGAGACTCTTCTTCTG^ 

TCTTATTCGTCTATAAAAGACCTACTTCTTAATAAAATATTCAAAACTCACGAAGAAGCTCAATTTAGACAAGACG 

GAATAGTGGTAAAATCTAATGGAGAGGTAATTGAAAAAACAAGCCTTACTCCCCACTCAATAGGATATATAGGTCT 

TGGATACGCAAAAAATTCAATAGAAAAGGGTTTGAATATTCTTTCTGTTAACAGCACATATCCTACAAAAGAAACA 

ATAAATAGCAATAAATACACCATTAAAAGAAATTTAATAATAGTTACAAATAACAAATACGAGGATAAAAGCGTAA 

CTCAATTTATTGATTTCATGACAAGCTCAACTGGACAAGATATTGTTGAAGAACAAGGCTTTTTAGGGATAAAAAC 

ATAA 

t736.nt 

TGTAAAAATCAAGACAATGAAAAAATTGTATCAATTGGAGGATCTACAACTGTAAGCCCAATACTAGACGAAATGA 
TTTTAAGATATAATAAAATAAACAATAATACTAAAGTAACATACGATGCACAAGGAAGTAGTGTTGGCATAAACGG 
GCTATTTAACAAAATATATAAAATAGCAATATCATCAAGAGATTTAACAAAAGAAGAAATTGAACAAGGGGCAAAA 
GAAACTGTATTTGCTTATGATGCTTTAATTTTCATTACAAGCCCTGAAATAAAAATTACAAATATTACAGAAGAAA 
ATCTAGCTAAAATACTAAATGGAGAAATTCAAAATTGGAAACAAGTGGGAGGTCCTGATGCTAAAATCAACTTTAT 
CAATCGAGACTCTTCTTCTGGTTCTTATTCGTCTATAAAAGACCTACTTCTTAATAAAATATTCAAAACTCACGAA 
GAAGCTCAATTTAGACAAGACGGAATAGTGGTAAAATCTAATGGAGAGGTAATTGAAAAAACAAGCCTTACTCCCC 
ACTCAATAGGATATATAGGTCTTGGATACGCAAAAAATTCAATAGAAAAGGGTTTGAATATTCTTTCTGTTAACAG 
CACATATCCTACAAAAGAAACAATAAATAGCAATAAATACACCATTAAAAGAAATTTAATAATAGTTACAAATAAC 
AAATACGAGGATAAAAGCGTAACTCAATTTATTGATTTCATGACAAGCTCAACTGGACAAGATATTGTTGAAGAAC 
AAGGCTTTTTAGGGATAAAAACATAA 

f7 52 .aa 

MNKKIiNEVLLKLDQDLIKCVKGSLDLEISGVTYSSKLVLPRFVFFALPGIHFDGHDFIEIAIQKGSNVWCSRDVD 
FYSPNVTYIKVDDFNIRKFMSNFSNIFYDEPSKKLKVIGVTGTDGKSSVCYYIYLLFKKKGVKVGFISTVFFDDGS 
GSLIKNPYRQSTPESTEIHSFLSTMVKNEAQYAILESTSHGLDIiETARLIDVNYFAVVTTNIGHEHLEFHGTIQ]^ 
LNVKLGLFRSVSDDAGFGVINLDDLYSSDFKNAVKKSFTYSLKSSKADFFVSFIDEKTDSTRFEFYHKGVKYLAIW 
SLLGSFlSrVElSrmAAIilLVSQILNIDIQDIVDKI^CIKSLIXSRMDSINLGQNFSVIIDYAHTPG 

TNRLISVFGSAGERDVEKRFLQGQIADIYSDLIILCDEDPRGENSMCIIKDIAKGIVNKVENKDLFFIADRKQAIE 
KAISIiAKAGDLWALGKGHESSIIYKNREVFWNEQEVVKNAILSLEKSEKEK 

t752 .aa 

CVKGSIJDLEISGWYSSKLVLPRFWFALPGIHFDGHDFIEIAIQKGSimA^CSRDVDFYSPNVTY^ 

FMSNFSNIFYDEPSKKLKVIGVTGTDGKSSVCYYIYLLFKKKGVKVGFISTVFFDDGSGSLIKNPYRQSTPESTEI 

HSFLSTMVKNEAQYAILESTSHGLDLETARLIDVNYFAWFTNIGHEHLEFHGTIQOTUWKLGLFRSVSDDAG 

VINLDDLYSSDFKNAVKKSFTYSLKSSKADFFVSFIDEKTDSTRFEFYHKGVKYLAWSLLGSFNfVENW^^ 

SQILNIDIQDIVDKIiNCIKSLDGRMDSINLGQNFSVIIDYAHTPGAFSKLFPIFKRFATNRLISVFGSAGERDVEK 

RFLQGQIADIYSDLIILCDEDPRGENSMCIIKDIAKGIWKVENKDLFFIADRKQAIEKAISIJUCAGDLVVALGKG 

HESSI lYKNREVFWNEQEWKNAILSLEKSEKEK 

f752.nt 

ATGAATAAAAAACTTAATGAAGTTTTATTAAAGTTAGATCAAGATTTAATAAAATGTGTAAAAGGTTC 
TAGAAATATCAGGAGTTACTTATAGTTCTAAATTGGTTTTGCCCAGGTTTGTGTTTTTTGCTCTTCC^ 
TTTTGATGGGCATGATTTTATTGAAATTGCAATTCAAAAGGGTAGTAATGTTC 

TTTTACAGTCCTAATGTTACTTATATTAAGGTAGATGACTTTAACATAAGAAAATTTATGTCTAAT^ 
TTTTTTATGATGAGCCTTCAAAAAAATTAAAAGTTATTGGAGTCACIXXjCACTC 
TTATATATATCTTCTTTTTAAAAAAAAGGGTGTTAAAGTAGGTTTTATATCGACAGTATTTT^ 
GGAAGCTTGATTAAAAATCCTTACAGACAATCAACTCCCGAGTCTACGGAAATACATTCATTTTO 
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TTAAAAATGAAGCTCAATATGCAATTCTTGAATCTACTTCTCATGGGCTTGACCTTGAAACAGCAAGGCTTATTG 

TGTTAATTATTTTGCAGTTGTTTTTACCAATATTGGACATGAGCATCTTGAATTTCATGGCACAATTCAAAATTAT 

TTGAATGTCAAGCTGGGTCTTTTTCGGTCTGTTAGTGATGATGCTGGTTTTGGGGTTATTAATCTTGATGACCTTT 

ATTCTTCTGATTTTAAGAATGCTGTTAAGAAATCTTTTACTTATAGCTTAAAAAGCAGTAAAGCGGATTTTTTTGT 

TAGTTTTATTGATGAGAAAACCGATTCTACTAGATTTGAATTTTATCACAAGGGGGTTAAATATCTTGCTAATGTT 

AGCCTACTGGGGAGTTTTAATGTTGAGAATGTAATGGCTGCTCTTATTTTAGTTTCTCAAATTTTAAATATCGATA 

TTCAAGATATTGTTGATAAACTTAACTGCATTAAAAGTCTTGATGGGCGTATGGATAGTATTAATTTGGGGCi^^ 

TTTTTCTGTAATAATTGATTATGCTCATACTCCTGGTGCTTTTTCCAAGCTTTTTCCTATTTTTAAAAGATTTGCT 

ACCAATAGATTGATTTCTGTTTTTGGCTCTGCAGGAGAAAGAGATGTTGAAAAAAGATTTTTGCAAGGGCAAATCG 

CAGATATTTATTCTGATTTAATAATACTTTGCGATGAAGATCCAAGAGGCGAGAATAGTATGTGTATAATTAAAGA 

CATTGCAAAAGGAATTGTAAATAAAGTTGAAAATAAGGATTTATTTTTTATTGCTGATAGAAAGCAGGCTATTGAA 

AAAGCAATAAGTCTTGCAAAAGCAGGAGATTTGGTTGTTGCTTTGGGCAAAGGTCATGAAAGTTCAATAATTTATA 

AAAATAGAGAAGTTTTTTGGAATGAACAAGAGGTAGTTAAAAATGCTATTTTAAGTTTAGAAAAATCAGAAAAGGA 

GAAGTGA 

t752.nt 

TGTGTAAAAGGTTCTCTTGATTTAGAAATATCAGGAGTTACTTATAGTTCTAAATTGGTTTTGCCCAGGTTTGTGT 
TTTTTGCTCTTCCAGGAATTCATTTTGATGGGCATGATTTTATTGAAATTGCAATTCAAAAGGGTAGTAATGTTGT 
TGTGTGTTCACGAGATGTGGATTTTTACAGTCCTAATGTTACTTATATTAAGGTAGATGACTTTAACATAAGAAAA 
TTTATGTCTAATTTTTCAAATATTTTTTATGATGAGCCTTCAAAAAAATTAAAAGTTATTGGAGTCACTGGCACTG 
ACGGGAAAAGTTCTGTTTGTTATTATATATATCTTCTTTTTAAAAAAAAGGGTGTTAAAGTAGGTTTTATATCGAC 
AGTATTTTTTGATGATGGGAGTGGAAGCTTGATTAAAAATCCTTACAGACAATCAACTCCCGAGTCTACGGAAATA 
CATTCATTTTTAAGCACCATGGTTAAAAATGAAGCTCAATATGCAATTCTTGAATCTACTTCTCATGGGCTTGACC 
TTGAAACAGCAAGGCTTATTGATGTTAATTATTTTGCAGTTGTTTTTACCAATATTGGACATGAGCATCTTGAATT 
TCATGGCACAATTCAAAATTATTTGAATGTCAAGCTGGGTCTTTTTCGGTCTGTTAGTGATGATGCTGGTTTTGGG 
GTTATTAATCTTGATGACCTTTATTCTTCTGATTTTAAGAATGCTGTTAAGAAATCTTTTACTTATAGCTTAAAAA 
GCAGTAAAGCGGATTTTTTTGTTAGTTTTATTGATGAGAAAACCGATTCTACTAGATTTGAATTTTATCACAAGGG 
GGTTAAATATCTTGCTAATGTTAGCCTACTGGGGAGTTTTAATGTTGAGAATGTAATGGCTGCTCTTATTTTAGTT 
TCTCAAATTTTAAATATCGATATTCAAGATATTGTTGATAAACTTAACTGCATTAAAAGTCTTGATGGGCGTATGG 
ATAGTATTAATTTGGGGCAAAATTTTTCTGTAATAATTGATTATGCTCATACTCCTGGTGCTTTTTCCAAGCTTTT 
TCCTATTTTTAAAAGATTTGCTACCAATAGATTGATTTCTGTTTTTGGCTCTGCAGGAGAAAGAGATGTTGAAAAA 
AGATTTTTGCAAGGGCAAATCGCAGATATTTATTCTGATTTAATAATACTTTGCGATGAAGATCCAAGAGGCGAGA 
ATAGTATGTGTATAATTAAAGACATTGCAAAAGGAATTGTAAATAAAGTTGAAAATAAGGATTTATTTTTTATTGC 
TGATAGAAAGCAGGCTATTGAAAAAGCAATAAGTCTTGCAAAAGCAGGAGATTTGGTTGTTGCTTTGGGCAAAGGT 
CATGAAAGTTCAATAATTTATAAAAATAGAGAAGTTTTTTGGAATGAACAAGAGGTAGTTAAAAATGCTATTTTAA 
GTTTAGAAAAATCAGAAAAGGAGAAGTGA 

f798.aa 

MVFRTYKHLELIMLPMLMLSCAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTI 
IiGEIX3KEIPEFKNKFGYSYIISPVKMIX3KYSYYASLLILFETTKNGDDEyEIEDVKFVTAG 
QEEGYWAYPFGILMSDEIKNAFKLTYKNGHWrmiLADLTVKNKLTQETKIYKISLNSKLIIEFLKE 
DIAGDLFEDI 

t798 .aa 

CAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTILGEDGKEIPEFKNKFGYSYI 

ISPVKMIXSKYSYYASLLILFETTKNGDDEYEIEDVKFWAGSTLELKNSLI^VENSQEEGYVTAYPFGILMS 

NAFKLTYKNGHWlOYMLADLTVKNKLTQETKIYKISLJSrSKLIIEFLKEVLKENSILKDIAGDLFEDI 

f798.nt 

ATGGTATTTAGAACATATAAACATTTGGAACTAATAATGCTGCCCATGTTAATGCTGAGTTGCGCTTTTTTTAAGA 
AACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTACATTTAATATCAGGCAA 
AATTTCAAATAAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAACAAAGGCAATC 
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TTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATAATATCTCCTGTAAAAA 
TGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATGGAGATGATGAATATGA 
AATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTTAGCTGTTGAAAATTCA 
CAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAAAATGCTTTTAAATTAA 
CATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTACTCAAGAAACTAAAAT 
TTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGAAAATTCTATATTAAAA 
GACATAGCTGGAGATTTATTTGAAGATATATAA 

t798 .nt 

TGCGCTTTTTTTAAGAAACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTAC 
ATTTAATATCAGGCAAAATTTCAAATAAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAAC 
AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATA 
ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATG 
GAGATGATGAATATGAAATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTT 
AGCTGTTGAAAATTCACAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAA 
AATGCTTTTAAATTAACATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTA 
CTCAAGAAACTAAAATTTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGA 
AAATTCTATATTAAAAGACATAGCTGGAGATTTATTTGAAGATATATAA 

fSOS.aa 

MLRKLKDISKIVLVTDGLTPNCQTCGKLIANGDEVYIAEDGLFHSVKSNTIAGSTLTMIQGLKNLIEFGFSLSDAV 
QASSYNPTRILNIDKKGLICHGYDANLNVLDKDFNLKLTMIESKIIFNNL 

tSOS.aa 

CQTCGKLIANGDEVYIAEDGLFHSVKSNTIAGSTLTMIQGLKNLIEFGFSLSDAVQASSYNPTRILNIDKKGLICH 
GYDANLNVLDKDFNLKLTMIESKI IFNNL 

fSOS.nt 

ATGCTTAGAAAGCTTAAAGATATAAGTAAAATAGTCCTTGTAACTGACGGACTTACTCCGAATTGTCAAACTTGTC 
GAAAACTAATTGCAAACGGAGACGAAGTTTATATTGCAGAAGATGGATTATTCCATAGCGTGAAAAGCAACACAAT 
AGCTGGATCAACACTCACAATGATACAAGGTCTTAAAAATTTAATAGAATTTGGTTTCAGCTTAAGCGATGCTGTT 
CAAGCAAGCTCTTACAATCCAACAAGAATTCTCAATATTGATAAAAAGGGCTTAATATGTCATGGATATGATGCi^ 
ACCTCAATGTCCTAGATAAAGATTTTAATCTAAAGTTAACAATGATAGAATCTAAAATAATTTTTAACAATCTCTA 
A 

tSOS.nt 

TGTCAAACTTGTGGAAAACTAATTGCAAACGGAGACGAAGTTTATATTGCAGAAGATGGATTATTCCATAGCGTGA 

AAAGCAACACAATAGCTGGATCAACACTCACAATGATACAAGGTCTTAAAAATTTAATAGAATTTGGTTTC 

AAGCGATGCTGTTCAAGCAAGCTCTTACAATCCAACAAGAATTCTCAATATTGATAAAAAGGGCTTAATATGTCAT 

GGATATGATGCAAACCTCAATGTCCTAGATAAAGATTTTAATCTAAAGTTAACAATGATAGAATCTAAAATAATTO 

TTAACAATCTCTAA 

f635.aa 

MKILWLIILVNLFLSCGNESKEKSNLGLRLRELEISGGGSESKIEWKEFIEKEDKNILKIVNSIDKKARFFNLIG 

LEFFKI^QYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAKNLRVKDEVEKYIIL^^ 

AIS^^^YWDLDKQLEAKNYLNKLGDMGEDYFEFLMLRGAIm^S 

LK 



t635.aa 
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CGNESKEKSNLGLRLRELEISGGGSESKIEVYKEFIEKEDKNILKIVNSIDKKARFFNLIGLEFFKLGQYGPAIEY 

FAKNLEINP^^S^fLSHFYIGVASYNLAiaSILRVKDEVEKYIIIJlENSFLKSLSIP^ 

AKNYIJ^LGDMGEDYFEFLMIiRGAlSrYYSLGDLGNAILFYDKASKKASTEEQKEGVSR 

£635. nt 

atgaaaattttgtggttaataattcttgttaatttatttttatcttgtggcaatgaatctaaagaaaaatca;^ 

TTGGTCTTAGATTAAGAGAATTGGAAATTTCAGGTGGTGGATCTGAATCTAAGATTGAAGTTTATAAAGAATTTAT 

tgaaaaagaagataagaatattttaaagatagttaattccattgataagaaagccagattttttaatttaattggt 

CTTGAATTTTTTAAGCTTGGTCAGTACGGACCTGCTATTGAATATTTTGCTAAAAATTTAGAAATCAATCCCAATA 

attatttatctcatttttatataggtgttgcttcttataatttagctaaaaatttaagagtaaaagatgaagttga 

AAAATACATAATTCTTGCTGAAAATTCTTTTTTAAAATCACTTTCAATTAGAGATGATTTTAAAGATTCTCTTTTT 

gccatttctaatatgtacgtatatgatcttgataaacaacttgaagctaaaaattatttaaataaacttggtgata 

TGGGTGAGGACTATTTTGAGTTTTTAATGTTAAGAGGTGCAAATTATTATTCGCTGGGCGATCTTGGTAATGCTAT 

attgttttatgataaagctagtaaaaaggcttcaactgaagagcaaaaagaaggtgtttctaggatcatgagtaat 
ttgaagtaa 

t635.nt 

TGTGGCAATGAATCTAAAGAAAAATCAAATCTTGGTCTTAGATTAAGAGAATTGGAAATTTCAGGTGGTGGATCTG 
AATCTAAGATTGAAGTTTATAAAGAATTTATTGAAAAAGAAGATAAGAATATTTTAAAGATAGTTAATTCCATTGA 
TAAGAAAGCCAGATTTTTTAATTTAATTGGTCTTGAATTTTTTAAGCTTGGTCAGTACGGACCTGCTATTGAATAT 
TTTGCTAAAAATTTAGAAATCAATCCCAATAATTATTTATCTCATTTTTATATAGGTGTTGCTTCTTATAATTTAG 
CTAAAAATTTAAGAGTAAAAGATGAAGTTGAAAAATACATAATTCTTGCTGAAAATTCTTTTTTAAAATCACTTTC 
AATTAGAGATGATTTTAAAGATTCTCTTTTTGCCATTTCTAATATGTACGTATATGATCTTGATAAACAACTTGAA 
GCTAAAAATTATTTAAATAAACTTGGTGATATGGGTGAGGACTATTTTGAGTTTTTAATGTTAAGAGGTGCAAATT 
ATTATTCGCTGGGCGATCTTGGTAATGCTATATTGTTTTATGATAAAGCTAGTAAAAAGGCTTCAACTGAAGAGCA 
AAAAGAAGGTGTTTCTAGGATCATGAGTAATTTGAAGTAA 

f314.aa 

MNNCLIKFFIFLLVFSNSYVAFSKNVNVLIVTAMDSEFTOINKLMSNKEEIVLKEYGLNKKILKGKLSNRW^^ 
CGVGKVNAGVWTSYILSKYNISHVINSGVAGGWSAKYKDIKVGDVWSSEVAYHDVDLTKFGYKVGQLTGGLPQK 
FNANKNLIKNAIEAIKSKVGGSNAYSGLIVSGDQFIDPTYINKIIGNFKDVIAVEMEGAAIGHVSHMFNIPFIVIR 
SISDIVNKEGNEVEYSKFSKIAAFNSAKWQEILRKLZ 

t314.aa 

KNVIWLI VTAMDS EFDQINKLMSNKEEI VLKEYGLNKKI LKGKLSNRNVWI ICGVGKVNAG^ 
VINSGVAGGWSAKYKDIKVGDVVVSSEVAYHDVDLTKFGYKVGQLTGGLPQKFNANKNLIKNAIEAIKSKVGGSN 
AYSGLIVSGDQFIDPTYINKIIGNFKDVIAVEMEGAAIGHVSHMFNIPFIVIRSISDIVNKEGNEVEYSKFSKIAA 
FNSAKWQEILRKLZ 

f314.nt 

ATGAATAATTGTTTAATAAAGTTTTTTATTTTTTTATTAGTTTTTTCAAACAGTTATGTTGCTTTTTCTAAA^ 

tcaatgttttaatagtaactgctatggactctgagtttgatcagataaataagcttatgtctaataaggaagaaat 

agttcttaaggagtatggtcttaataaaaagattttaaaggggaagttgtctaatcgcaatgttatggttat^ 

tgtggggttggtaaggttaatgctggtgtgtggactagctacattttgtcaaaatacaacataagtcatgtcatta 

attctggcgttgctggtggcgttgttagtgctaaatacaaagatattaaagtgggagatgtggtggtgtcttcaga 

ggttgcatatcatgatgttgatttgactaaatttggatacaaggtaggacagcttacaggaggattgcctcaaaaa 

tttaatgccaataaaaatttaattaagaatgccatagaggccattaaatcaaaggttggaggttctaatgcatatt 

caggattaatagtttcaggagatcagtttattgatcct^cttatattaacaaaattataggaaacttta;^ 

aatagctgttgagatggaaggtgcagcaatagggcatgtttctcatatgtttaatataccttot 

tcaatatctgacattgtaaataaagaagggaatgaggttgaata.tagtaaattttcti^^ 

cagccaaagttgtacaagaaattttaagaaaactttaa 
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t314.nt 

AAAAATGTCAATGTTTTAATAGTAACTGCTATGGACTCTGAGTTTGATCAGATAAATAAGCTTATGTCTAATAAGG 

AAGAAATAGTTCTTAAGGAGTATGGTCTTAATAAAAAGATTTTAAAGGGGAAGTTGTCTAATCGCAATGTTATGGT 

TATTATTTGTGGGGTTGGTAAGGTTAATGCTGGTGTGTGGACTAGCTACATTTTGTCAAAATACAA^ 

GTCATTAATTCTGGCGTTGCTGGTGGCGTTGTTAGTGCTAAATACAAAGATATTAAAGTGGGAGATGTGGTGGTO 

CTTCAGAGGTTGCATATCATGATGTTGATTTGACTAAATTTGGATACAAGGTAGGACAGCTTACAGGAGGATTGCC 

TCAAAAATTTAATGCCAATAAAAATTTAATTAAGAATGCCATAGAGGCCATTAAATCAAAGGTTGGAGGTTCTAAT 

GCATATTCAGGATTAATAGTTTCAGGAGATCAGTTTATTGATCCAACTTATATTAACAAAATTATAGGAAACTTTA 

AAGATGTAATAGCTGTTGAGATGGAAGGTGCAGCAATAGGGCATGTTTCTCATATGTTTAATATACCTTTTATAGT 

TATTAGGTCAATATCTGACATTGTAAATAT^GAAGGGAATGAGGTTGAATATAGTAAATTTTCTAAAATAGCTGCT 

TTCAATTCAGCCAAAGTTGTACAAGAAATTTTAAGAAAACTTTAA 

f32.aa 

MNTKTLYLISLILLACNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEVV 
KIEKTLEKTERYGIEGNWILVNYKGTKRYIFSKDINIVNNLIIDHSKZ 

t32 . aa 

CNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEWKIEKTLEKTERYGIE 
GNWILVNYKGTKRYIFSKDINIVNNLIIDHSKZ 

f32 .nt 

ATGAATACAAAAACATTATATTTAATATCCTTAATTCTTTTAGCTTGCAATAAAAATAACAAAATTCCTCTCATTC 
AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATAAAATGGGCATAATAATAAAAGATTATGCTTT 
TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTACGCAATTCTACTCAGAAAAGACGAAGTCGTA 
AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAAGGAAATTGGATCCTAGTCAATTACAAGGGAA 
CTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATTTAATAATTGATCATTCTAAATAG 

t32 .nt 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATA 
AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATT^ 
CGCAATTCTACTCAGAAAAGACGAAGTCGTAAAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAA 
GGAAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATT 
TAATAATTGATCATTCTAAATAG 

f320.aa 

MKSIYALLFLFINLSLLANNISKKDLEVLLKIAQAMNKECKNFIEKNPIQFLKEIKPLVDAEK^ 
PENYKIPDLVNIDDFEDLKNLGAKTIKVRKILIEDLIRLIKDAKKFGIEIKIKSAYRTQEYQKFLFDYNVKTYGRK 
VAETQSAIPGHSQHHMGTAIDFINIDDNLLNTKEGKWLYENSLKYGFSVSYPKGYETDTGYKAEPWHYLYIGPKPC 
FIQKKYFNNLQHKLLEFWNQNKTNLINLIEKYAN2 

t320.aa 

^^S^ISKiCDLEVLLKIAQAMNKECKNFIEKNPIQFLKEIKPLVDAEKNNLLTLINKKIPIPENYK 

KNLGAKTIKWKILIEDLIRLIKDAKKFGIEIKIKSAYRTQEYQKFLFDYNVKTYGRKVAETQSAIPGHSQHHMGT 
AIDFINIDDm.IJSPrKEGKWLYENSLKYGFSVSYPKGYETDTGYKAEPVfflYLYIGPKPCFIQKKYFN^ 
NQNKTNL INL I EKYANZ 



f320.nt 
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ATGAAATCAATTTATGCTTTATTATTTCTATTTATTAATTTATCTTTGTTGGCTAACAACATTTCAAAAAAAGATT 

TAGAAGTACTGCTAAAGATTGCCCAAGCAATGAATAAGGAATGCAAAAATTTTATTGAAAAAAATCCT^ 

CTTAAAAGAAATAAAACCCTTAGTAGATGCAGAAAAAAATAACCTCTTAACTCTAATAAATAAAAAAATACCAATT 

CCTGAAAATTATAAAATACCTGATCTGGTAAATATTGATGATTTTGAAGATCTTAAAAATCTTGGAGCA7VAGACTA 

TTAAAGTAAGAAAAATATTAATCGAAGATTTAATTCGACTAATAAAAGATGCAAAAAAATTTGGGATTGAAATTAA 

AATCAAATCTGCTTACAGAACGCAAGAATATCAAAAATTTTTATTTGATTACAATGTCAAAACTTATGG^ 

GTTGCAGAAACCCAATCAGCAATTCCAGGCCATTCTCAACATCATATGGGAACAGCAATAGATTTTATAAATATAG 

ATGATAATTTACTAAACACAAAAGAAGGAAAATGGCTTTATGAAAACTCTCTAAAATACGGATTTTCCGTTTCATA 

CCCAAAAGGATATGAAACGGACACTGGATATAAAGCAGAGCCTTGGCACTACTTATACATAGGACCTAAGCCATGC 

TTTATTCAGAAAAAATATTTTAATAATTTACAACATAAGCTTCTTGAATTTTGGAACCAGAACAAAACAAATCTTA 

TTAACCTAATTGAAAAATATGCAAACTAA 

t320 .nt 

AACAACATTTCAAAAAAAGATTTAGAAGTACTGCTAAAGATTGCCCAAGCAATGAATAAGGAATGCAAAAATTTTA 
TTGAAAAAAATCCTATTCAGTTCTTAAAAGAAATAAAACCCTTAGTAGATGCAGAAAAAAATAACCTCTTAACTCT 
AATAAATAAAAAAATACCAATTCCTGAAAATTATAAAATACCTGATCTGGTAAATATTGATGATTTTGAAGATCTT 
AAAAATCTTGGAGCAAAGACTATTAAAGTAAGAAAAATATTAATCGAAGATTTAATTCGACTAATAAAAGATGCAA 
AAAAATTTGGGATTGAAATTAAAATCAAATCTGCTTACAGAACGCAAGAATATCAAAAATTTTTATTTGATTACAA 
TGTCAAAACTTATGGCAGAAAAGTTGCAGAAACCCAATCAGCAATTCCAGGCCATTCTCAACATCATATGGGAACA 
GCAATAGATTTTATAAATATAGATGATAATTTACTAAACACAAAAGAAGGAAAATGGCTTTATGAAAACTCTCTAA 
AATACGGATTTTCCGTTTCATACCCAAAAGGATATGAAACGGACACTGGATATAAAGCAGAGCCTTGGCACTACTT 
ATACATAGGACCTAAGCCATGCTTTATTCAGAAAAAATATTTTAATAATTTACAACATAAGCTTCTTGAATTTTGG 
AACCAGAACAAAACAAATCTTATTAACCTAATTGAAAAATATGCAAACTAA 

f342 .aa 

MLYLGDNKAMRTKIIIMTIIILLAPISGFSNSKESARGKFGAGIILPLPIALQINIGNFDLDIGLYSGVNNLFSDW 
KTLFIALDYIFYIYTFPGAANILDFSVGAGGYGTIWFSRFGGSKSGSGPMSIGARLPLALNIAVFRKKFDIFLRIA 
PGLGMNVWSNGVGFRWEVF AGLGLRF WFT Z 

t342 .aa 

LAPISGFSNSKESARGKFGAGIILPLPIALQINIGNFDLDIGLYSGVNNLFSDWKTLFIALDYIFYIYTFPGAANI 

LDFSVGAGGYGTIWFSRFGGSKSGSGPMSIGARLPLAIJ^IAVFRKKFDIFLRIAPGLGM^^WSNGVGF 

GLRFWFTZ 

f342.nt 

ATGCTATACTTAGGAGATAATAAAGCAATGAGAACAAAAATAATTATTATGACAATTATTATTTTATTAGCCCCAA 

TCTCAGGATTTTCTAATTCAAAAGAATCTGCAAGGGGTAAATTTGGAGCAGGAATTATACTTCCATTACCAAT^ 

TCTACAGATTAATATAGGAAACTTTGATCTTGACATTGGTCTTTACAGCGGAGTAAATAATTTGTTTTCAGACTGG 

AAAACATTATTTATAGCATTAGACTATATTTTCTACATATACACATTCCCGGGAGCTGCTAATATTTTGGATTTTT 

CAGTTGGCGCAGGGGGATATGGAACAATATGGTTTTCAAGATTTGGAGGCAGTAAGTCAGGCTCAGGACCAATGAG 

CATTGGAGCAAGATTGCCTTTGGCCTTAAATATTGCAGTATTTAGGAAGAAATTCGACATATTTTTACGAATAGCA 

CCCGGACTTGGAATGAATGTTTGGAGTAATGGCGTTGGATTTAGATGGGAAGTATTCGCAGGATTGGGACTAAGAT 

TCTGGTTTACTTAA 

t342.nt 

TTAGCCCCAATCTCAGGATTTTCTAATTCAAAAGAATCTGCAAGGGGTAAATTTGGAGCAGGAATTATACTTCCAT 

TACCAATTGCTCTACAGATTAATATAGGAAACTTTGATCTTGACATTGGTCTTTACAGCGGAGTAAATAATTTGTT 

TTCAGACTGGAAAACATTATTTATAGCATTAGACTATATTTTCTACATATACACATTCCCGGGAGCTGCTAATATT 

TTGGATTTTTCAGTTGGCGCAGGGGGATATGGAACAATATGGTTTTCAAGATTTGGAG^ 

GACCAATGAGCATTGGAGCAAGATTGCCTTTGGCCTTAAATATTGCAGTATTTAGGAAGAAATTCGAC^^ 
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ACGAATAGCACCCGGACTTGGAATGAATGTTTGGAGTAATGGCGTTGGATTTAGATGGGAAGTATTCGCAGGATTG 
GGACTAAGATTCTGGTTTACTTAA 

f352.aa 

MNKTKNRSLTYFIILSCISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYE 
KLPENIRDKKLPQKRMDENDLKSVIENYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTNKLK^ 
EIKKLLNKKIKPKEDENYEKINIENIEEETDDDFEDNYEYNDEIEEQMRTITLLMKEZ . 

t352 .aa 

CISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYEDPKKGKDLKLPENIRDKKLPQKR^ 
DENDLKSVIENYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTNKLKNEIVEIKKLLNKKIKPKEDE 
NYEKINIENIEEETDDDFEDNYEYNDEIEEQMRTITLLMKEZ 

f352 .nt 

ATGAATAAAACAAAAAATCGAAGCCTTACGTATTTTATAATACTTTCATGTATATCATTATTTGGGGCTAATAATA 
ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTGAAGAATTTAAAAGTTCTGGGAATAAAAG 
CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTATGAAGACCCAAAAAAGGGTAAAGATCTA 
AAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATGGACGAAAATGATCTAAAATCTGTAATTC 
AAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAAAACCAAAAATCAAAAAACATCGGAAAATGAAAA 
TAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTTAACCAATAAATTAAAAAACGAAATAGTA 
GAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAAAATTACGAAAAAATAAATATTGAAAACA 
TTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATGATGAAATTGAAGAACAAATGAGGACAAT 
TACCCTTCTAATGAAGGAATAA 

t352 .nt 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTG 

AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTA 

TGAAGACCCAAAAAAGGGTAAAGATCTAAAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATG 

GACGAAAATGATCTAAAATCTGTAATTGAAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAA^ 

AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAA^ 

AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATG^ 

AATTACGAAAAAATAAATATTGAAAACATTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATG 

ATGAAATTGAAGAACAAATGAGGACAATTACCCTTCTAATGAAGGAATAA 

£301. aa 

MQIDGKIYSIISFPVRDSVSTLGVIGILICFDESLDIIENQLYSSLKFGSKNYNFFMLDRNYMPIFSNLIS^ 

FSTAYSENFLSKVIAYAKKDSSSSQYTFNYERDFYSLNFVKTDDFLTQGLIIJWNSIPIMFKSI^ 

AIIFYLCNTFVFSLINDFNRIVDYQKSKSDPFSLESPLEVKYSSSIISYISSKLDNLSSKSNESFEKIKFYSEDLN 

EYLEQIETAISNTESIDSSILVYEQLRDTFSRFEKSIVDILKGFESIADPINDHNKYISEISSNFEESVSFFYSID 

K^^JEIFNKVATINSTDIENIKSKVFDLNIVFENV^rKNFADLLSQTNSLQSVNKLLVSISAQTl^ 

GDAGKSFAWAEEIRKIAINSGKYSKTIKI)ELKTVDSIIAVINSEII>riYKNFIDIQDIWDNNFSRHEK\^ 

HFKEIGEFKERYLSHDTKIRDAKNMYKEIFNNHYFISGKFI^FSQDLKEFKVSKMNLDAVSS 

ILKTKELIQKINDEIKDILFZ 

t301.aa 

CFDESUDIIENQLYSSLKFGSKNYWFFMLDRimiPIFSNIJraLQAKSFSTAYSENF 
YEIU:>FYSLNFVKTDDFLTQGLILNVNSIPIMFKSNWIFVAFLLLSFAIIFYLCNTFVFSLIND 
DPFSLESPLEVKYSSSIISYISSKLDNLSSKSNESFEKIKFYSEDLNEYLEQIETAISNTESIDSSILVYEQLRDT 
FSRFEKSIVDILKGFESIADPINDHNKYISEISSNFEESVSFFYSIDKNLEIFNKVATINSTDIENIKSKVFDIJ^ 
VFENVNKNFADLLSQTNSLQS WKLLVS I S AQTNMIAMNAAI EAAK^ 
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DELKTVDSIIAVINSEIDTIYKNFIDIQDNVDNNFSRHEKVDLTLAKHFKEIGEFKERYLSHDTKIRDAK^^ 
FNNHYFISGKFNNFSQDLKEFKVSKMNLDAVSSLQEYSSLVKSSKDKILKTKELIQKINDEIKDILFZ 

£301. nt 

ATGCAAATAGATGGGAAAATTTATTCTATAATAAGTTTTCCAGTTAGAGATTCTGTTTCAACATTGGGTGTGATAG 
GGATTTTAATATGCTTTGATGAGTCGTTAGATATTATTGAAAATCAGTTGTATTCTTCTCTTAAATTTGGTAGTAA 
AAATTATAATTTTTTTATGCTTGACAGAAATTACATGCCCATTTTTTCAAACCTTAATAATCTTCAGGCCAAATCT 
TTTTCTACAGCTTATAGTGAGAATTTTTTGAGTAAAGTTATAGCTTATGCTAAAAAAGATTCTTCTAGCTCTCAGT 
ACACTTTTAATTATGAAAGAGATTTTTATTCTTTAAACTTTGTAAAAACCGATGATTTTTTGACTCAGGGGCTTAT 
TTTAAATGTCAATTCCATTCCTATTATGTTTAAATCAAATTGGGTTATATTTGTTGCATTTTTATTATTGTCTTTT 
GCAATTATTTTTTATTTATGCAATACTTTTGTTTTTTCATTAATTAATGATTTTAACAGAATTGTTGACTATCAAA 
AATCAAAAAGCGATCCTTTTAGTCTTGAATCTCCCTTAGAGGTTAAGTATTCTTCATCTATTATTTCTTATATTAG 
TTCAAAGCTAGATAATCTGTCTTCTAAGAGTAATGAATCTTTTGAGAAGATAAAATTTTATTCTGAAGATTTGAAT 
GAATATTTGGAACAAATAGAAACTGCTATATCAAATACTGAGAGTATAGATTCTAGCATTTTAGTTTACGAACAAC 
TAAGAGATACTTTTTCTAGATTTGAAAAATCAATTGTTGATATTTTAAAAGGCTTTGAATCTATTGCTGATCCGAT 
TAATGATCACAATAAATATATATCAGAAATCTCTTCAAATTTTGAAGAGAGTGTTAGTTTTTTCTATAGTATAGAT 
AAAAATTTAGAAATTTTTAATAAGGTTGCTACTATAAATTCTACTGATATTGAAAATATTAAAAGTAAGGTTTTTG 
ATTTAAATATTGTTTTTGAAAATGTGAATAAAAATTTTGCAGATCTTTTGTCTCAAACAAATAGTTTGCAAAGTGT 
AAATAAACTTTTAGTTTCAATTTCAGCTCAGACCAATATGCTTGCTATGAATGCAGCAATTGAAGCAGCAAAAGCA 
GGTGATGCAGGTAAAAGTTTTGCAGTTGTTGCTGAGGAGATTAGAAAGCTTGCTATTAATTCTGGAAAATATTCTA 
AAACCATTAAAGATGAACTTAAAACGGTCGACAGCATTATTGCAGTAATTAATTCAGAGATTGATACAATTTATAA 
AAATTTCATAGACATTCAAGATAATGTGGACAACAATTTTTCAAGACACGAGAAAGTAGATCTTACTCTTGCTAAG 
CATTTTAAAGAAATTGGCGAGTTTAAAGAAAGGTATTTGTCTCACGATACTAAGATCAGAGATGCTAAGAATATGT 
ATAAAGAAATATTTAATAATCATTATTTTATTAGTGGCAAGTTTAACAACTTTAGTCAAGATTTAAAAGAGTTTAA 
AGTTTCTAAGATGAATTTAGATGCGGTAAGTTCTCTTCAAGAATATTCATCTTTAGTAAAGTCTTCTAAGGATAAG 
ATATTAAAGACAAAGGAATTGATTCAAAAGATTAATGATGAGATTAAAGATATTCTTTTTTAG 

t301.nt 

TGCTTTGATGAGTCGTTAGATATTATTGAAAATCAGTTGTATTCTTCTCTTAAATTTGGTAGTAAAAATTATAATT 

TTTTTATGCTTGACAGAAATTACATGCCCATTTTTTCAAACCTTAATAATCTTCAGGCCAAATCTTTTTCTACAGC 

TTATAGTGAGAATTTTTTGAGTAAAGTTATAGCTTATGCTAAAAAAGATTCTTCTAGCTCTCAGTACACTTTTAAT 

TATGAAAGAGATTTTTATTCTTTAAACTTTGTAAAAACCGATGATTTTTTGACTCAGGGGCTTATTTTAAATGTCA 

ATTCCATTCCTATTATGTTTAAATCAAATTGGGTTATATTTGTTGCATTTTTATTATTGTCTTTTGCAATTATTTT 

TTATTTATGCAATACTTTTGTTTTTTCATTAATTAATGATTTTAACAGAATTGTTGACTATCAAAAATCAAAAAGC 

GATCCTTTTAGTCTTGAATCTCCCTTAGAGGTTAAGTATTCTTCATCTATTATTTCTTATATTAGTTCAAAGCTAG 

ATAATCTGTCTTCTAAGAGTAATGAATCTTTTGAGAAGATAAAATTTTATTCTGAAGATTTGAATGAATATTTGGA 

ACAAATAGAAACTGCTATATCAAATACTGAGAGTATAGATTCTAGCATTTTAGTTTACGAACAACTAAGAGATACT 

TTTTCTAGATTTGAAAAATCAATTGTTGATATTTTAAAAGGCTTTGAATCTATTGCTGATCCGATTAATGATC 

ATAAATATATATCAGAAATCTCTTCAAATTTTGAAGAGAGTGTTAGTTTTTTCTATAGTATAGATAAAAATTTAGA 

AATTTTTAATAAGGTTGCTACTATAAATTCTACTGATATTGAAAATATTAAAAGTAAGGTTTTTGATTTAAATATT 

GTTTTTGAAAATGTGAATAAAAATTTTGCAGATCTTTTGTCTCAAACAAATAGTTTGCAAAGTGTA^ 

TAGTTTCAATTTCAGCTCAGACCAATATGCTTGCTATGAATGCAGCAATTGAAGCAGCAAAAGCAGGTGATGCAGG 

TAAAAGTTTTGCAGTTGTTGCTGAGGAGATTAGAAAGCTTGCTATTAATTCTGGAAAATATTCTAAAACCATTAAA 

GATGAACTTAAAACGGTCGACAGCATTATTGCAGTAATTAATTCAGAGATTGATACAATTTATAAAAATTTCATAG 

ACATTCAAGATAATGTGGACAACAATTTTTCAAGACACGAGAAAGTAGATCTTACTCTTGCTAAGCATTTTAAAGA 

AATTGGCGAGTTTAAAGAAAGGTATTTGTCTCACGATACTAAGATCAGAGATGCTAAGAATATGTATAAAGAAATA 

TTTAATAATCATTATTTTATTAGTGGCAAGTTTAACAACTTTAGTCAAGATTTAAAAGAGTTTAAAGTTTCTAAGA 

TGAATTTAGATGCGGTAAGTTCTCTTCAAGAATATTCATCTTTAGTAAAGTCTTCTAAGGATAAGATATTAAAGAC 

AAAGGAATTGATTCAAAAGATTAATGATGAGATTAAAGATATTCTTTTTTAG 
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MSIDKVPDEAFAEKIVGDGIAILPTSKnSLIJ^PCDGKIGKIFKTNHAFSLETKEGVEIFVHFGINTLNI^ 

AEEGINVKQGEVIIRLDLEYLKEHSESVITPWIANSDEVSSIEYSFGRLENDSEYILSSSTVLTEEIRHKISQTK 

PVIAGKDLVLRVKKZ 

t346.aa 

CrXSKIGKIFKTNHAFSLETKEGVEIFVHFGINTLNLNGKGFTRVAEEGINVKQGEVIIRLDLEYLKEHSESVITPV 
VIANSDEVSSIEYSFGRLENDSEYILSSSTVLTEEIRHKISQTKPVIAGKDLVLRVKKZ 

f346.nt 

ATGTCAATTGATAAGGTTCCCGATGAAGCTTTTGCTGAAAAAATAGTTGGCGATGGAATTGCAATTCTTCCAACAA 
GCAATGAGTTGTTGGCGCCTTGTGATGGGAAAATAGGTAAAATTTTTAAAACCAATCATGCCTTTAGCCTTGAAAC 
TAAAGAGGGCGTTGT^AATTTTTGTCCATTTTGGAATTAATACTCTTAATTTAAATGGTAAGGGTTTTACAAGAGTT 
GCTGAAGAGGGCATTAATGTTAAACAAGGTGAAGTTATTATTAGGCTTGATCTTGAATATTTAAAAGAGCATTCAG 
AATCCGTTATTACTCCGGTTGTTATTGCAAATTCTGATGAAGTTTCAAGTATAGAATATTCTTTTGGAAGGCTTGA 
AAATGATTCTGAATATATTTTATCATCTTCAACTGTCTTGACAGAAGAAATTAGGCATAAAATATCTCAAACAAAG 
CCTGTTATAGCGGGCAAAGATTTGGTGTTGCGAGTTAAAAAGTAA 

t346.nt 

TGTGATGGGAAAATAGGTAAAATTTTTAAAACCAATCATGCCTTTAGCCTTGAAACTAAAGAGGGCGTTGAAATTT 
TTGTCCATTTTGGAATTAATACTCTTAATTTAAATGGTAAGGGTTTTACAAGAGTTGCTGAAGAGGGCATTAATGT 
TAAACAAGGTGAAGTTATTATTAGGCTTGATCTTGAATATTTAAAAGAGCATTCAGAATCCGTTATTACTCCGGTT 
GTTATTGCAAATTCTGATGAAGTTTCAAGTATAGAATATTCTTTTGGAAGGCTTGAAAATGATTCTGAATATATTT 
TATCATCTTCAACTGTCTTGACAGAAGAAATTAGGCATAAAATATCTCAAACAAAGCCTGTTATAGCGGGCAAAGA 
TTTGGTGTTGCGAGTTAAAAAGTAA 

f373.aa 

MNYQRIKNYCKFTSVFLFFLFSCVSNELKLDQSLVKGKLVNGLRYYIYKNQTPKNAVNMGIVFNVGSLNEEDN^ 

lAHYLEHMAFNGTKDYPGNSIVDVLKKFGMQFGADINAATSFDFTYYRLDLSDGNNKDEIDESINILRNWASQISF 

MKEEIDLERNIIIEEKKLGETYPGRIYEKMDKFLTSGSLYEFRSPIGLEEQILSFQPEDFKKFYRKWYRPELASVI 

WGDIDPIEIEEKIKKQFVSWKNPTDKIKEVKVSLDVELKDKFLLLEDLEVGEPSLMFFKKEIINFVKTKDDLLNA 

IKKSLLAALFENRFSELKTAGVKQFKNVSNKDFFSFKSDNNTIVAKSISLNFNPDHLNEGIQDFFYELERIRKFGF 

TQGELEKVRSQFYKSLELRKKNINKTNSWAIFQDLIEIAINGSNKFDMNEYCDLSFQYLEKIDLKTINNLVGREFD 

VKNCAIFYSYHGRAHPVLTLEDIDNLQKIALKRELKPYENSLIEGKFFKKSLDDKDIIRENEFENEISSFVLENGV 

EVYFKYNDQKKGVIDFSATSWGGLINEDLKLIPVLSFAPGWSGSGYGDYSALQIEKYLSDKAVSLRVGVGAQESY 

ISGSSDKKDLETLFQLIYFTFKEPKIDDVSLQNAINNIKALIKSNENSSDYHFHKAISKFLNNNDPRFEDTKDSDL 

QYFTKENILSFYKKRFTYANNFKFVLLETQIFRQZ 

t373.aa 

CVSNELKLDQSLVKGKLVNGLRYYIYKNQTPKNAVNMGIVFNVGSLNEEDNERGIAHYLEHMAFNGTKDYPGNSIV 

DVLKiCFGMQFGADINAATSFDFTYYRLDLSDGNNKDEIDESINILRIJWASQISFMKEEIDLERNIIIEEKKLGETY 

PGRIYEKMDKFLTSGSLYEFRSPIGLEEQILSFQPEDFKKFYRKWYRPELASVIWGDIDPIEIEEKIKKQFVSWK 

NPTDKIKEVKVSLDVELKDKFLLLEDLEVGEPSLMFFKKEIINFVKTKDDLLNAIKKSLIJUVLFENRFSELKTAGV 

KQFKNVSNKDFFSFKSDNNTIVAKSISLNFNPDHIJSrEGIQDFFYELERIRKFGFTQGELEKVRSQFYKSLELRKKN 

INKTNSWAIFQDLIEIAINGSNKFDMNEYCDLSFQYLEKIDLKTlNNLVGREFDVKNCAIFYSYHGRAHPVIiTLED 

IDNLQKIALKRELKPYENSLIEGKFFKKSLDDKDIIRENEFENEISSFVLENGVEVYFKYNDQKKGVIDFSATSWG 

GLINEDLKLIPVLSFAPGWSGSGYGDYSALQIEKYLSDKAVSLRVGVGAQESYISGSSDKKDLETLFQLIYFTFK 

EPKIDDVSLQNAINNIKALIKSNENSSDYHFHKAISKFLNNNDPRFEDTKDSDLQYFTKENILSFYKKRFTYAN^ 

KFVLLETQIFRQ2 




PCT/US98/12718 



£373. nt 



wo 98/59071 

162 

TABLE 1. Nucleotide and Amino Acid Sequences 

ATGAATTATCAAAGAATTAAGAATTATTGTAAATTTACAAGCGTTTTTCTATTTTTTTTGTTTTCCTGTGTTTCTA 
ATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAGGAAAACTTGTCAATGGGCTAAGGTATTATATTTATAAAAATCA 
AACCCCAAAGAATGCCGTTAATATGGGAATTGTTTTTAATGTGGGCTCACTTAATGAAGAAGATAATGAGAGGGGA 
ATAGCGCATTATCTTGAACATATGGCTTTTAATGGTACAAAAGATTATCCAGGGAATTCTATAGTTGATGTTCTTA 
AAAAATTTGGAATGCAATTTGGTGCTGACATTAATGCTGCTACTAGTTTTGATTTCACTTATTATAGACTTGATTT 
GTCAGATGGTAATAATAAAGATGAAATTGATGAATCTATAAATATTTTGAGAAACTGGGCTTCTCAAATCAGTTTC 
ATGAAAGAAGAAATAGATCTAGAGCGAAATATTATTATTGAGGAAAAAAAGCTTGGTGAGACTTATCCTGGAAGAA 
TTTATGAGAAAATGGATAAGTTTTTGACAAGCGGAAGTCTTTATGAATTTAGAAGTCCTATTGGACTTGAAGAGCA 
AATTTTATCTTTTCAGCCAGAAGATTTTAAAAAATTTTATAGAAAGTGGTATAGGCCAGAACTTGCAAGTGTTATT 
GTGGTAGGAGATATTGATCCTATAGAAATTGAAGAGAAGATAAAGAAGCAATTTGTTTCTTGGAAAAATCCAACCG 
ATAAAATTAAAGAAGTAAAAGTAAGTTTAGACGTAGAGCTTAAGGATAAATTTTTACTTTTAGAAGATTTGGAAGT 
TGGAGAGCCTAGTTTAATGTTCTTTAAAAAGGAAATTATTAACTTTGTAAAGACCAAAGATGACCTTTTAAATGCT 
ATTAAAAAGTCTTTATTAGCCGCTCTTTTTGAAAATAGATTTTCTGAATTAAAGACTGCTGGGGTAAAGCAATTTA 
AAAATGTTTCAAATAAAGATTTTTTCTCATTTAAATCAGATAACAATACCATTGTTGCAAAATCGATTTCTTTAAA 
CTTTAATCCAGATCATTTGAACGAAGGAATACAAGACTTTTTTTATGAGCTTGAGAGGATAAGAAAATTTGGATTT 
ACCCAAGGTGAGCTTGAAAAAGTTAGATCTCAATTTTACAAATCTTTAGAATTAAGGAAAAAGAATATAAATAAAA 
CAAATTCATGGGCTATTTTTCAGGATTTAATAGAAATTGCTATTAATGGTTCTAATAAATTTGATATGAATGAATA 
TTGCGATCTTTCTTTTCAATATTTGGAAAAGATTGATTTAAAAACAATAAACAATCTTGTAGGAAGAGAGTTTGAT 
GTAAAAAATTGTGCAATTTTTTATTCTTACCATGGAAGAGCACATCCTGTTTTAACTCTTGAAGATATTGACAATC 
TTCAAAAGATAGCTTTAAAAAGAGAGTTAAAGCCTTATGAGAATTCTTTAATTGAAGGTAAATTTTTTAAGAAGTC 
TTTAGATGATAAAGATATTATTAGAGAAAATGAGTTTGAAAATGAAATTTCGTCATTTGTTCTTGAAAATGGGGTT 
GAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTAATTGATTTTAGTGCAACTTCTTGGGGAGGTTTAATTA 
ATGAAGATTTAAAACTTATTCCTGTTTTATCTTTTGCTCCCGGAGTAGTATCTGGTTCGGGTTATGGTGATTATTC 
TGCATTACAGATTGAAAAATATTTATCAGATAAAGCTGTTTCTTTAAGAGTTGGGGTTGGAGCTCAAGAATCATAT 
ATTTCTGGAAGTTCAGATAAAAAAGATCTTGAAACTCTTTTTCAGCTTATATATTTTACTTTTAAGGAACCCAAAA 
TTGATGATGTTTCTTTGCAAAATGCTATTAATAATATAAAAGCATTAATAAAGAGCAATGAAAATAGTTCTGATTA 
TCATTTTCATAAAGCCATTAGTAAATTTTTAAACAATAATGATCCTAGATTTGAAGATACAAAAGATAGTGATTTG 
CAATATTTTACAAAAGAAAATATTTTGTCTTTTTATAAGAAAAGGTTTACTTATGCAAATAATTTTAAGTTTGTCT 
TGCTGGAGACTCAGATATTCAGACAATAA 

t373 .nt 

TGTGTTTCTAATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAGGAAAACTTGTCAATGGGCTAAGGTATTATATTT 

ATAAAAATCAAACCCCAAAGAATGCCGTTAATATGGGAATTGTTTTTAATGTGGGCTCACTTAATGAAGAAGATAA 

TGAGAGGGGAATAGCGCATTATCTTGAACATATGGCTTTTAATGGTACAAAAGATTATCCAGGGAATTCTATAGTT 

GATGTTCTTAAAAAATTTGGAATGCAATTTGGTGCTGACATTAATGCTGCTACTAGTTTTGATTTCACTTATTATA 

GACTTGATTTGTCAGATGGTAATAATAAAGATGAAATTGATGAATCTATAAATATTTTGAGAAACTGGGCTTCTCA 

AATCAGTTTCATGAAAGAAGAAATAGATCTAGAGCGAAATATTATTATTGAGGAAAAAAAGCTTGGTGAGACTTAT 

CCTGGAAGAATTTATGAGAAAATGGATAAGTTTTTGACAAGCGGAAGTCTTTATGAATTTAGAAGTCCTATTGGAC 

TTGAAGAGCAAATTTTATCTTTTCAGCCAGAAGATTTTAAAAAATTTTATAGAAAGTGGTATAGGCCAGAACOT 

AAGTGTTATTGTGGTAGGAGATATTGATCCTATAGAAATTGAAGAGAAGATAAAGAAGCAATTTGTTTCTTGGAAA 

AATCCAACCGATAAAATTAAAGAAGTAAAAGTAAGTTTAGACGTAGAGCTTAAGGATAAATTTTTACTTTTAGAAG 

ATTTGGAAGTTGGAGAGCCTAGTTTAATGTTCTTTAAAAAGGAAATTATTAACTTTGTAAAGACCAAAGATG 

TTTAAATGCTATTAAAAAGTCTTTATTAGCCGCTCTTTTTGAAAATAGATTTTCTGAATTAAAGACTGCT^ 

AAGCAATTTAAAAATGTTTCAAATAAAGATTTTTTCTCATTTAAATCAGATAACAATACCATTGTTGCAAAATCGA 

TTTCTTTAAACTTTAATCCAGATCATTTGAACGAAGGAATACAAGACTTTTTTTATGAGCTTGAGAGGATAAGAAA 

ATTTGGATTTACCCAAGGTGAGCTTGAAAAAGTTAGATCTCAATTTTACAAATCTTTAGAATTAAGGAAAAAGAA 

ATAAATAAAACAAATTCATGGGCTATTTTTCAGGATTTAATAGAAATTGCTATTAATGGTTCTAATAAATTTGATA 

TGAATGAATATTGCGATCTTTCTTTTCAATATTTGGAAAAGATTGATTTAAAAACAATAAACAATCTTGTAGGAAG 

AGAGTTTGATGTAAAAAATTGTGCAATTTTTTATTCTTACCATGGAAGAGCACATCCTGTTTTAACTCTTC 

ATTGACAATCTTCAAAAGATAGCTTTAAAAAGAGAGTTAAAGCCTTATGAGAATTCTTTAATTC 

TTAAGAAGTCTTTAGATGATAAAGATATTATTAGAGAAAATGAGTrrcAAAATGAAATTTCGTCATTTGTTC 

AAATGGGGTTGAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTAATTGATTTTAGTGCAACT 

GGTTTAATTAATGAAGATTTAAAACTTATTCCTGTTTTATCTTTTGCTCCCGGAGTAGTATCT^ 

GTGATTATTCTGCATTACAGATTGAAAAATATTTATCAGATAAAGCTGTTTCTTTAAGAGTTGGGGTTGGAGCT 

AGAATCATATATTTCTGGAAGTTCAGATAAAAAAGATCTTGAAACTCTTTTTCAGCTTATATATTTTACTTT^ 
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GAACCCAAAATTGATGATGTTTCTTTGCAAAATGCTATTAATAATATAAAAGCATTAATAAAGAGCAATGAAAAT^ 
GTTCTGATTATCATTTTCATAAAGCCATTAGTAAATTTTTAAACAATAATGATCCTAGATTTGAAGATACAAAAGA 
TAGTGATTTGCAATATTTTACAAAAGAAAATATTTTGTCTTTTTATAAGAAAAGGTTTACTTATGCAAAT^ 
AAGTTTGTCTTGCTGGAGACTCAGATATTCAGACAATAA 

f384.aa 

MDWDFEKIIFLLNESTRLALSGCAKLILDFKSDGSIVTQVDKQIEQFLFKEIKKPGNFVLGEETISTYKEEYIKDA 
LISESTFIIDPirXSTSSFAAGLPSYGISIAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDFNK 
FIFDNSKCYNIHSLLAVSRSIIRLFNLDISSHIHINGSCVYSFAKLFTGSYKAYFSFVGLWDIAACLAIGNKLGMV 
GEFYCGNKMTLDILDSMYILEPNNHKRWSLKDFFIYSDNKSTIDIIRKDANKKINK 

t384 .aa 

CAKLILDFKSDGSIVTQVDKQIEQFLFKEIKKPGNFVLGEETISTYKEEYIKDALISESTFIIDPIDGTSSFAAGL 
PSYGISLAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDFNKFIFDNSKCYNIHSLLAVSRSII 
RliFNLDISSHIHINGSCVYSFAKLFTGSYKAYFSFVGLWDIAACLAIGNKLGMVGEFYCGNKMTLDILDSMYILEP 
NNHKRWSLKDFFIYSDNKSTIDIIRKDANKKINKZ 

£384. nt 

ATGGATTGGGATTTTGAAAAAATTATATTTTTATTAAATGAATCAACTAGGCTTGCATTAAGTGGTTGTGCTAAAT 
TAATTTTAGATTTTAAATCTGATGGGTCTATTGTAACTCAGGTTGATAAGCAAATTGAGCAATTCTTATTCAAAGA 
GATCAAAAAGCCTGGAAATTTTGTTCTTGGAGAAGAGACAATATCTACTTATAAAGAAGAGTATATCAAAGATGCT 
TTAATATCAGAGAGTACTTTTATTATTGATCCTATTGATGGAACTTCTTCTTTTGCAGCAGGCCTTCCTTCATATG 
GAATATCGCTAGCGTATGCTAGTGGCGGCAAAATTATTGAAGGAGCCATTTCTCTTCCTTTAAGCGGAGAGTTTTT 
TATTACTTCTAAAGATAATGTATTTTATGCTAAAAAAAACATTGGTAGCTATCCTTTAAAAAAGGATTTTAATAAA 
TTTATTTTTGATAATTCTAAATGTTACAATATTCATAGTTTACTTGCAGTTTCAAGGTCTATTATAAGGTTATTTA 
ATCTTGATATTTCTTCTCATATTCATATTAATGGTTCTTGTGTATATTCTTTTGCTAAACTTTTTACAGGTTCTTA 
TAAGGCCTACTTTTCTTTTGTAGGACTTTGGGATATTGCAGCGTGTTTAGCTATTGGTAATAAATTGGGCATGGTT 
GGCGAATTTTATTGTGGTAATAAAATGACATTAGATATCTTAGATTCAATGTATATTTTAGAGCCTAATAATCATA 
AAAGATGGTCCTTGAAAGATTTTTTTATTTATTCTGATAATAAATCAACAATAGACATTATAAGAAAAGATGCAAA 
TAAAAAAATCAATAAGTAA 

t384 .nt 

AGTGGTTGTGCTAAATTAATTTTAGATTTTAAATCTGATGGGTCTATTGTAACTCAGGTTGATAAGCAAATTGA^ 

AATTCTTATTCAAAGAGATCAAAAAGCCTGGAAATTTTGTTCTTGGAGAAGAGACAATATCTACTTATAAAGAAGA 

GTATATCAAAGATGCTTTAATATCAGAGAGTACTTTTATTATTGATCCTATTGATGGAACTTCTTCTTTTGCAGCA 

GGCCTTCCTTCATATGGAATATCGCTAGCGTATGCTAGTGGCGGCAAAATTATTGAAGGAGCCATTTCTCTTCCTT 

TAAGCGGAGAGTTTTTTATTACTTCTAAAGATAATGTATTTTATGCTAAAAAAAACATTGGTAGCTATCCTTTAAA 

AAAGGATTTTAATAAATTTATTTTTGATAATTCTAAATGTTACAATATTCATAGTTTACTTGCAGTTTCAAGGTCT 

ATTATAAGGTTATTTAATCTTGATATTTCTTCTCATATTCATATTAATGGTTCTTGTGTATATTCTTTTGCTAAAC 

TTTTTACAGGTTCTTATAAGGCCTACTTTTCTTTTGTAGGACTTTGGGATATTGCAGCGTGTTTAGCTATTC^ 

TAAATTGGGCATGGTTGGCGAATTTTATTGTGGTAATAAAATGACATTAGATATCTTAGATTCAATGTATATTT^ 

GAGCCTAATAATCATAAAAGATGGTCCTTGAAAGATTTTTTTATTTATTCTGATAATAAATCAACAATAGACATTA 

TAAGAAAAGATGCAAATAAAAAAATCAATAAGTAA 

f 860 .aa 

MAFYKLNDNIALAEDLLKYLLSSILNECSQDMDFLEITYIEKGLIKKLEWINSNFEVITY 

KPYWGIDLQTDHERYLTEETFKKPWVIDYPKNFKAFYMKANK1)NKTVKGMDILVPKIGEIIGGSEREDDLQKLEN 
RIKELNIJ^IEHLNWYLDLRRFGSAPHSGFGLGLERLVQYSTGISNIRDSIPFPRTPKNLYFZ 
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CSQDMDFLENYIEKGLIKKLEWINSNFEVITYTKAIEILENSKKNFEIKPYWGIDLQTDHERYLTEETFKKPVVV 

IDYPKNFKAFYMKANKDNKTVKGiyroiLVPKIGEIIGGSEREDDLQKLENRIKELNLNIEHLNV^ 

GFGLGLERLVQYSTGISNIRDSIPFPRTPKNLYFZ 

f 860 .nt 

ATGGCTTTTTATAAGCTTAACGACAATATTGCCCTAGCAGAAGATCTCTTGAAATATCTTTTAAGTTCAATTTTAA 

ACGAATGCTCACAAGATATGGATTTTTTAGAAAATTACATTGAAAAAGGTTTAATTAAAAAACTAGAAAATGTAAT 

AAATTCAAATTTTGAGGTTATTACCTATACTAAAGCAATTGAAATTCTTGT^AAACTCAAAAAAAAATTTTGAAATA 

AAACCTTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACAGAAGAGACTTTTAAAAAACCGGTAG 

TGGTCATTGATTATCCAAAAAATTTCAAAGCATTTTACATGAAAGCAAATAAAGACAATAAAACTGTTAAAGGA^ 

GGACATACTTGTTCCAAAAATTGGAGAGATTATAGGGGGAAGCGAAAGAGAAGATGACCTTCAAAAATTAGAAAAT 

AGAATAAAAGAATTAAACTTAAACATTGAACATCTAAACTGGTATCTTGATCTAAGAAGATTTGGCTCGGCTCCTC 

ATTCTGGCTTTGGACTTGGACTTGAAAGATTGGTGCAATACTCAACAGGAATATCTAATATAAGAGATTCAATACC 

ATTCCCAAGGACTCCTAAAAATCTTTATTTTTAA 

t860 .nt 

TGCTCACAAGATATGGATTTTTTAGAAAATTACATTGAAAAAGGTTTAATTAAAAAACTAGAAAATGTAATAAATT 
CAAATTTTGAGGTTATTACCTATACTAAAGCAATTGAAATTCTTGAAAACTCAAAAAAAAATTTTGAAATAAAACC 
TTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACAGAAGAGACTTTTAAAAAACCGGTAGTGGTC 
ATTGATTATCCAAAAAATTTCAAAGCATTTTACATGAAAGCAAATAAAGACAATAAAACTGTTAAAGGAATGGACA 
TACTTGTTCCAAAAATTGGAGAGATTATAGGGGGAAGCGAAAGAGAAGATGACCTTCAAAAATTAGAAAATAGAAT 
AAAAGAATTAAACTTAAACATTGAACATCTi\AACTGGTATCTTGATCTAAGAAGATTTGGCTCGGCTCCTCATTCT 
GGCTTTGGACTTGGACTTGAAAGATTGGTGCAATACTCAACAGGAATATCTAATATAAGAGATTCAATACCATTCC 
CAAGGACTCCTAAAAATCTTTATTTTTAA 

f446.aa 

MKILRLCLLFLFFACTFDYDEYSSRSDVAKKFPSIQILGIKYYDWYNKEQTVLNSLSFSYFNDYKIYKAENGRFL 

YHSLDNEISGKFl^LEGSYITKDLDMRDSVEFKIEDKlSnsrYYLLNSNRLLWKNKDK^ 

KGFSYFLKSNVFYFDSGVEGIMNZ 

t446 .aa 

CTFDYDEYSSRSDVAKKFPSIQILGIKYYDVVYNKEQTVLNSLSFSYFNDYKIYKAENGRFLYHSLDNEISGKFNN 

LEGSYITKDLDMRDSVEFKIEDKl^nSTYYLLNSNRLLWKNKDKKLQSPPNELVLIRFNDSKINGKGFSYFLK 

DSGVEGIMNZ 

f446.nt 

ATGAAAATACTTAGACTTTGTTTGTTGTTTTTGTTTTTTGCTTGTACTTTTGATTATGATGAGTATTCTAGTAGAT 

CTGATGTGGCCAAAAAGTTTCCTTCAATACAAATATTAGGAATCAAGTATTATGATGTTGTATACAATAAAGAGCA 

AACCGTTTTAAATTCTTTAAGCTTTAGTTATTTCAATGACTATAAAATTTATAAGGCAGAGAATGGAAGGTTTTTA 

TATCATTCCCTAGATAATGAAATTTCAGGGAAGTTTAATAATTTGGAAGGTTCTTATATTACAAAGGATTTGGATA 

TGAGAGATTCTGTAGAATTTAAAATAGAAGATAAAAATAATTATTATTTGCTTAATTCAAATAGGCTTTTATC 

GAATAAAGACAAGAAGTTGCAATCCCCCCCAAATGAGCTAGTATTAATTAGATTTAATGATAGCAAAATAAACGGA 

AAAGGATTTTCTTATTTTTTAAAGAGCAATGTTTTTTATTTTGATTCTGGAGTTGAAGGAATCATGAATTGA 

t446.nt 

TCTACTTTTGATTATGATGAGTATTCTAGTAGATCTGATGTGGCCAAAAAGTTTCCTTCA^ 

TCAAGTATTATGATGTTGTATACAATAAAGAGCAAACCGTTTTAAATTCTTTAAGCTTTAGT^ 

TAAAATTTATAAGGCAGAGAATGGAAGGTTTTTATATCATTCCCTAGATAATGAAATTTCAGGGAAGTT^ 

TTCGAAGGTTCTTATATTACAAAGGATTTGGATATGAGAGATTCTGTAGAATTTAAAATAGAAG 

ATTATTTGCTTAATTCAAATAGGCTTTTATGGAAGAATAAAGACAAGAAGTTGCAATCCCCCCCAAATGAGCTAGT 
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ATTAATTAGATTTAATGATAGCAAAATAAACGGAAAAGGATTTTCTTATTTTTTAAAGAGCAATGTTTTTTATTTT 
GATTCTGGAGTTGAAGGAATCATGAATTGA 

f457.aa 

MKQKLSWILLFCFLSCRSESRLAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDLKAKIRGLKSQAKDDFIFYPL 
FFNNLRYEIIGRKNISKGFEFEWIKNINFQNGIEKFLAKLNKIEGRSLNIKNLEKKERKKIFDNLINEVIGELDD 
FDYTEWHFFRWKS S S ES YK I ELLGDVLNI QS RNKL INDLFL VLS PG I Z 

t457.aa 

CFLSCRSESRLAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDLKAKIRGLKSQAKDDFIFYPLFFNNLRYEIIG 
RKNISKGFEFEVVIKNINFQNGIEKFLAKLNKIEGRSLNIKNLEKKERKKIFDNLINEVIGELDDFDYTEVVHFFR 
WKSSSESYKIELLGDVLNIQSRNKLINDLFLVLSPGIZ 

f457.nt 

ATGAAGCAAAAATTAAGTTGGATTTTATTATTTTGTTTTTTGTCTTGTAGATCTGAATCTAGATTGGCTGAAAATG 

TTTTAATAGAGTTTTTTGATTCTATTAAAAATTTTCAAAGCAGTCCTGAAATATTTTTTAATTATTTAAATATTCC 

AAGTGATGATGATCTGAAGGCAAAAATTCGTGGGTTGAAATCTCAGGCAAAGGATGATTTCATTTTTTATCCTTTG 

TTTTTTAATAATCTAAGATATGAGATAATAGGTAGAAAAAATATTTCTAAGGGCTTTGAATTTGAAGTTGTTATTA 

AAAATATTAACTTTCAAAACGGTATAGAAAAATTTTTGGCTAAATTAAATAAAATTGAAGGGAGATCTTTA^ 

TAAAAATTTAGAAAAAAAAGAGCGTAAAAAAATATTTGACAATTTAATAAATGAAGTTATTGGAGAGTTGGATGAT 

TTTGATTACACTGAAGTTGTTCATTTTTTTAGAGTAGTTAAGAGTTCTTCTGAAAGTTATAAAATAGAGCTTTTAG 

GAGATGTTTTAAATATACAGTCTAGAAATAAGCTTATTAATGATCTTTTTTTGGTTTTATCGCCTGGAATTTAA 

t457 .nt 

TGTTTTTTGTCTTGTAGATCTGAATCTAGATTGGCTGAAAATGTTTTAATAGAGTTTTTTGATTCTATTAAAAATT 
TTCAAAGCAGTCCTGAAATATTTTTTAATTATTTAAATATTCCAAGTGATGATGATCTGAAGGCAAAAATTCGTGG 
GTTGAAATCTCAGGCAAAGGATGATTTCATTTTTTATCCTTTGTTTTTTAATAATCTAAGATATGAGATAATAGGT 
AGAAAAAATATTTCTAAGGGCTTTGAATTTGAAGTTGTTATTAAAAATATTAACTTTCAAAACGGTATAGAAAAAT 
TTTTGGCTAAATTAT^TAAAATTGAAGGGAGATCTTTAAATATTAAAAATTTAGAAAAAAAAGAGCGTAAAAAAAT 
ATTTGACAATTTAATAAATGAAGTTATTGGAGAGTTGGATGATTTTGATTACACTGAAGTTGTTCATTTTTTTAGA 
GTAGTTAAGAGTTCTTCTGAAAGTTATAAAATAGAGCTTTTAGGAGATGTTTTAAATATACAGTCTAGAAATAAGC 
TTATTAATGATCTTTTTTTGGTTTTATCGCCTGGAATTTAA 

f542.aa 

MRIVIFIFGILLTSCFSRNGIESSSKKIKISMLVDGVLDDKSFNSSANEALLRLKKDFPENIEEVFSCAISGVYSS 
YVSDLDNLKRNGSDLIWLVGYMLTDASLLVSSENPKISYGIIDPIYGDDVQIPENLIAWFRVEPRCFFGWLYCSQ 
KKLFWQNRFYRGNEGZ 

t:542 .aa 

CFSRNGIESSSKKIKISMLVDGVLDDKSFNSSANEALLRLKKDFPENIEEVFSCAISGVYSSYVSDLDNLKRNGSD 
LIWIiVGYMLTDASLLVSSENPKISYGIIDPIYGDDVQIPENLIAWFRVEPRCFFGWLYCSQKKLFWQNRFYRGNE 
GZ 

f542.nt 

ATGAGAATTGTAATTTTTATATTCGGTATTTTGTTGACTTCTTGCTTTAGTAGAAATGGAATAGAATCTAGTTCAA 

AAAAAATTAAGATATCCATGTTGGTAGATGGTGTTCTTGACGACAAATCTTTTAATTCTAGTGCTAATGAGG^^ 

ATTACGCTTGAAAAAAGATTTTCCAGAAAATATTCAAGAAGTTTTTTCTTC 

TATGTTTCAGATCTTGATAATTTAAAAAGGAATGGCTCAGACTTGATTTGGCTTGTAG^ 

CATCTTTATTGGTTTCATCGGAGAATCCAAAAATTAGCTATGGAATAATAGATCCCAT^ 
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GATTCCTGAAAACTTGATTGCTGTTGTTTTCAGAGTAGAGCCAAGGTGCTTTTTTGGCTGGCTATATTO 
AAAAAGCTTTTCTGGCAAAATAGGTTTTATAGGGGGAATGAAGGGTAA 

t542.nt 

TGCTTTAGTAGAAATGGAATAGAATCTAGTTCAAAAAAAATTAAGATATCCATGTTGGTAGATGGTGTTCTTGACG 
ACAAATCTTTTAATTCTAGTGCTAATGAGGCTTTATTACGCTTGAAAAAAGATTTTCCAGAAAATATTGAAGAAGT 
TTTTTCTTGTGCTATTTCTGGAGTTTATTCTAGTTATGTTTCAGATCTTGATAATTTAAAAAGGAATGGCTCAGAC 
TTGATTTGGCTTGTAGGGTACATGCTTACGGACGCATCTTTATTGGTTTCATCGGAGAATCCAAAAATTAGCTATG 
GAATAATAGATCCCATTTATGGTGATGATGTTCAGATTCCTGAAAACTTGATTGCTGTTGTTTTCAGAGTAGAGCC 
AAGGTGCTTTTTTGGCTGGCTATATTGCAGCCAAAAAAAGCTTTTCTGGCAAAATAGGTTTTATAGGGGGAATGAA 
GGGTAA 

f93 .aa 

MKRILAMHDISSMGRTSLTICIPVISSFNMQVCPFVTAVLSASTAYKKFEIVDLTDHLEKFINIWKEQNEHFDILY 
TGFLGSEKQQITIEKIIKLIKFEKIVIDPVFADDGEIYPIFDNKIISGFRKIIKYANIITPNITELEMLSKSSKLN 
NKDDIIKAILNLDTKATVVVTSVKRGNLLGNICYNPKNKEYSEFFLEGLEQNFSGTGDLFTSLLIGYLEKFETEQA 
LEKTTKAIHLIIKESIKENVSKKEGVRIENFLKNTFZ 

t93 .aa 

CIPVISSFNMQVCPFVTAVLSASTAYKKFEIVDLTDHLEKFINIWKEQNEHFDILYTGFLGSEKQQITIEKIIKLI 
KFEKIVIDPVFADDGEIYPIFDNKIISGFRKIIKYANIITPNITELEMLSKSSKLNNKDDIIKAILNLDTKATVW 
TSVKRGNLLGNICYNPKNKEYSEFFLEGLEQNFSGTGDLFTSLLIGYLEKFETEQALEKTTKAIHLIIKESIKENV 
SKKEGVRIENFLKNTFZ 

f93 .nt 

ATGAAAAGAATTTTAGCAATGCATGATATTTCAAGCATGGGAAGAACATCTCTTACAATATGCATACCAGTAATAT 

CTTCGTTTAATATGCAAGTTTGTCCTTTTGTGACAGCTGTCCTTTCTGCTTCCACAGCTTATAAAAAATTTGAAAT 

AGTGGATTTAACCGATCATTTAGAAAAATTTATCAATATATGGAAAGAACAAAATGAGCACTTTGACATACTCTAT 

ACCGGATTTCTGGGAAGCGAAAAACAACAAATAACAATAGAGAAAATAATTAAATTAATAAAATTTGAAAAAATTG 

TAATTGATCCTGTGTTTGCTGACGATGGAGAAATTTACCCTATATTTGATAATAAAATAATTAGTGGATTTAGAAA 

AATCATAAAGTACGCAAACATAATAACACCCAATATCACAGAACTTGAAATGCTAAGCAAAAGCTCAAAACTTAAC 

AACAAAGATGATATCATAAAAGCAATATTAAATCTTGATACAAAAGCGACGGTAGTTGTTACAAGCGTTAAAAGGG 

GAAATCTCTTGGGAAACATTTGCTACAATCCTAAAAACAAAGAATACTCGGAGTTTTTTTTAGAAGGATTAGAACA 

AAATTTCAGTGGAACAGGAGATTTATTTACCAGCTTACTTATAGGATATTTGGAAAAATTTGAAACAGAGCAAGCC 

TTAGA/WWVACAACAAAGGCTATTCACCTAATAATAAAAGAGTCAATTAAAGAAAATGTTTCAAAAAAAGAAGG 

TCCGAATTGAAAATTTCTTAAA7VAATACATTTTGA 

t93.nt 

tgcataccagtaatatcttcgtttaatatgcaagtttgtccttttgtgacagctgtcctttctgcttccacagctt 
ataaaaaatttgaaatagtggatttaaccgatcatttagaaaaatttatcaatatatggaaagaacaaaatgagca 
ctttgacatactctataccggatttctgggaagcgaaaaacaacaaataacaatagagaaaataattaaatt;^ 
aaatttgaaaaaattgtaattgatcctgtgtttgctgacgatggagaaatttaccctatatttgat^ 

TTAGTGGATTTAGAAAAATCATAAAGTACGCAAACATAATAACACCCAATATCACAGAACTTGAAATGCTAAGCA^ 

aagctcaaaacttaacaacaaagatgatatcataaaagcaatattaaatcttgatacaaaagcgacggtagttgtt 
acaagcgttaaaaggggaaatctcttgggaaacatttgctacaatcctaaaaacaaagaatactcggagttttttt 
tagaaggattagaacaaaatttcagtggaacaggagatttatttaccagcttacttataggatatttggaaaaatt 
tgaaacagagcaagccttagaaaaaacaacaaaggctattcacctaataataaaagagtcaattaaagaaaatgtt 
tcaaaaaaagaaggggtccgaattgaaaatttcttaaaaaatacattttga 
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MGLYLKLLRQSINLKSLFPLSVLFFSCNVVDTDFSV1.EFKVANFNLNDDFSQGLLDSAYNII^ 
KNVLDLIlSn^VLFRAFKNAYFIIX5GSGLSVSILSKRKINIKVLSVMQDSCDLKLGL^ 

DFIKSIANLQISEQILYLKAQMDKLMFILDESEFVIFDLLIKNGFFSLINDSNYTSMLANKIDFRVFSNFFARVSL 
YSFMFVIADYLHSNYWENFPQKIVINZ 

tl05 .aa 

CNVVDTDFSVLEFKVANFNLNDDFSQGLLDSAYNII^RSFDLIIIKNLKNKNVLDLIN^ 

GLSVSILSKRKINIKVLSVMQDSCDLKLGLLVDFKFENNHYGIVIYNLSKDFIKSIANLQISEQILYLKAQMDKLM 
FILDESEFVIFDLLIKNGFFSLINDSNYTSMLANKIDFRWSNFFARVSLYSFMFVIADYLHSNYWENFPQKIVI 
NZ 

flOS.nt 

ATGGGCTTGTATTTGAAGTTGTTGAGACAAAGTATCAACTTGAAGAGTTTATTTCCGCTTAGTGTTTTATTTTTTT 
CCTGTAATGTTGTAGATACAGATTTTAGTGTTTTGGAGTTTAAGGTTGCAAATTTTAATTTAAATGATGATTTTTC 
TCAAGGGTTACTTGATTCTGCTTATAATATTCTAAATCGAAGTTTTGATTTAATAATTATTAAGAATCTTAAGAAT 
AAAAATGTTCTTGATTTAATTAATAATAGAGTTTTATTTAGAGCTTTTAAGAATGCTTATTTTATTGATCAAGGTA 
GTGGCCTTTCTGTTAGCATTCTTTCTAAGCGCAAAATAAATATTAAAGTTTTAAGTGTAATGCAAGATTCTTGCGA 
TTTAAAATTAGGATTGCTTGTGGATTTTAAATTTGAGAATAATCACTATGGTATTGTTATTTATAATTTAAGCAAG 
GATTTTATTAAAAGTATTGCCAATTTGCAAATTAGTGAACAAATTTTATATTTAAAAGCCCAAATGGATAAATTGA 
TGTTTATTTTAGATGAATCTGAATTTGTTATTTTTGATTTATTAATCAAAAATGGATTTTTTAGCTTAATAAATGA 
TTCAAACTACACTTCAATGTTAGCAAATA/yUVTTGATTTTAGAGTTTTTTCTAATTTTTTTGCTAGGGTTTCTTTA 
TATTCATTTATGTTTGTAATTGCAGATTATTTGCATAGCAATTATGTTGTTGAGAATTTTCCTCAAAAAATAGTTA 
TCAATTGA 

tl05 .nt 

TGTAATGTTGTAGATACAGATTTTAGTGTTTTGGAGTTTAAGGTTGCAAATTTTAATTTAAATGATGATTTTTCTC 

AAGGGTTACTTGATTCTGCTTATAATATTCTAAATCGAAGTTTTGATTTAATAATTATTAAGAATCTTAAGAATAA 

AAATGTTCTTGATTTAATTAATAATAGAGTTTTATTTAGAGCTTTTAAGAATGCTTATTTTATTGATCAAGGTAGT 

GGCCTTTCTGTTAGCATTCTTTCTAAGCGCAAAATAAATATTAAAGTTTTAAGTGTAATGCAAGATTCTTGCGATT 

TAAAATTAGGATTGCTTGTGGATTTTAAATTTGAGAATAATCACTATGGTATTGTTATTTATAATTTAAGCAAGGA 

TTTTATTAAAAGTATTGCCAATTTGCAAATTAGTGAACAAATTTTATATTTAAAAGCCCAAATGGATAAATTGATG 

TTTATTTTAGATGAATCTGAATTTGTTATTTTTGATTTATTAATCAAAAATGGATTTTTTAGCTTAATAA^ 

CAAACTACACTTCAATGTTAGCAAATAAAATTGATTTTAGAGTTTTTTCTAATTTTTTTGCTAGGGTTTCTTTATA 

TTCATTTATGTTTGTAATTGCAGATTATTTGCATAGCAATTATGTTGTTGAGAATTTTCCTCAAAAAATAGTTATC 

AATTGA 

flSO.aa 

MKTFVIIGLSNLGIHLLEDLSRLDCQIIIIDTSKELIEEYDVISTESFWEQFTKNALKRIIPVDTDAWIDFDDD 
LGKSALWHYCNLIiGLKEICVKTENRDDAEILKTIiGATKIIFPSKDAARRLTPLLVSPNLSTYNIIGYDIIVAETV 
IPKEYVGKTLFEADLRRECGITVIAVRNLSNSRYEFVDGDYFFLKDDKIVICGKPDSIENFTNNKDLIKDLISGSK 
EDENLNKDAEKKSRFLG I FNFMK I FQKDRKDNZ 

tl50 .aa 

CQIIIIOTSKELIEEYDVISTESFWEQFTKNALKRIIPVDTDAWIDFDDDIXSKSALWHYCNLLGLKEICVKTE 
NRDDAEILKTLGATKIIFPSKDAARRLTPLLVSPNLSTYNIIGYDIIVAETVIPKEYVGKTLFEADLRRECGITVI 
AVRNLSNSRYEFVDGDYFFLKDDKIVICGKPDSIENFTNNKDLIKDLISGSKEDENLNKDAEKKSRFLGIFNFMKI 
FQKDRKDNZ 
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ATGAAAACATTTGTTATTATTGGACTTAGTAATTTAGGCATTCACTTACTTGAAGATTTAAGCAGGCTTGATTGTC 
AAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATGTGATATCTACAGAAAGCTTTGTTGTTGA 
GCAATTCACTAAAAATGCTTTGAAAAGAATAATTCCAGTAGATACAGACGCTGTTGTTATTGATTTTGATGATGAT 
CTTGGCAAAAGTGCTCTTGTTACTCACTATTGTAATCTTTTAGGTTTGAAAGAAATATGCGTTAAGACAGAAAATA 
GAGATGATGCTGAAATCTTAAAAACTCTTGGGGCAACAAAAATTATATTTCCAAGTAAAGATGCTGCAAGAAGATT 
AACTCCATTATTAGTATCTCCAAATCTTTCAACTTATAATATTATTGGGTATGATATTATTGTTGCTGAAACTGTT 
ATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATCTTAGAAGAGAATGTGGGATTACAGTTATTGCTG 
TTAGAAATTTAAGTAATTCTAGGTATGAATTTGTTGATGGCGATTATTTTTTTTTAAAAGATC 

TTGTGGTAAACCAGATAGCATTGAAAATTTTACAAATAATT^AAGATTTAATTAAAGATTTAATTTCAGGCTCTAAA 
GAGGATGAAAATTTAAATAAAGATGCTGAGAAAAAATCTAGATTTTTAGGGATTTTCAATTTTATGAAAATTTTTC 
AAAAAGATCGTAAGGATAATTAG 

tlBO.nt 

TGTCAAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATGTGATATCTACAGAAAGCTTTGTTG 
TTGAGCAATTCACTAAAAATGCTTTGAAAAGAATAATTCCAGTAGATACAGACGCTGTTGTTATTGATTTTGATGA 
TGATCTTGGCAAAAGTGCTCTTGTTACTCACTATTGTAATCTTTTAGGTTTGAAAGAAATATGCGTTAAGACAGAA 
AATAGAGATGATGCTGAAATCTTAAAAACTCTTGGGGCAACAAAAATTATATTTCCAAGTAAAGATGCTGCAAGAA 
GATTAACTCCATTATTAGTATCTCCAAATCTTTCAACTTATAATATTATTGGGTATGATATTATTGTTGCTGAAAC 
TGTTATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATCTTAGAAGAGAATGTGGGATTACAGTTATT 
GCTGTTAGAAATTTAAGTAATTCTAGGTATGAATTTGTTGATGGCGATTATTTTTTTTTAAAAGATGATAAAATTG 
TAATTTGTGGTAAACCAGATAGCATTGAAAATTTTACAAATAATAAAGATTTAATTAAAGATTTAATTTCAGGCTC 
TAAAGAGGATGAAAATTTAAATAAAGATGCTGAGAAAAAATCTAGATTTTTAGGGATTTTCAATTTTATGAAAATT 
TTTCAAAAAGATCGTAAGGATAATTAG 

£219. aa 

MLIARIMNINTLFYGMIIIIFALISClOTKNIQyDKRIKKFLDKNKIEYKIDSENDFIAFKDINlSnsrEKEEVIIR^ 

NSYKNSKIREIFGIVKVFDINTPKIKEISDSLMSDSYNNRVFGSWEIIHNAERGINSLVYIVKAEEFANDTFLLDA 

IDEIASTISIFKKIITTNNENIDNNEENNNTNESNEQPTLKQEKTNSTKESNNELKEDQIEEELQEIKAQZ 

t219.aa 

CNHKNIQYDKRIKKFLDKNKIEYKIDSENDFIAFKDINNNEKEEVIIRSRLNSYKNSKIREIFGIVKVFDINTPKI 

KEISDSIJ^SDSYNlNmVFGSWEIIHNAERGINSLWIVKAEEFANDTFLLDAIDEIASTISIFKKIIT^ 

EENNNTNESNEQPTLKQEKTNSTKESNNELKEDQIEEELQEIKAQZ 

£219. nt 

ATGCTAATTGCAAGAATAATGAATATTAATACATTATTCTACGGCATGATCATTATCATTTTTGCACTCATTTCTT 

GCAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTTTAGATAAAAACAAAATTGAATATAAAATAGA 

CTCAGAAAATGACTTTATAGCATTTAAAGATATAAACAATAACGAAAAAGAAGAAGTAATCATCAGATCAAGACTA 

AACTCATATAAAAATTCAAAGATAAGAGAAATATTTGGAATTGTTAAAGTATTTGATATAAACACACCAAAAATAA 

AAGAAATATCTGACTCGCTTATGAGCGATAGTTATAAT7U\CAGAGTATTTGGATCGTGGGAGATTATTCATAATGC 

AGAAAGAGGAATCAACTCTTTGGTATATATTGTAAAAGCAGAAGAATTTGCAAATGATACATTTTTGCTTGATGCA 

ATTGATGAGATTGCCTCAACAATAAGTATTTTCAAAAAAATAATAACAACCAACAACGAAAACATTGATAAT^ 

AAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAAGAAAAAACAAATTCAACAAAAGAATC 

TAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATCAAAGCCCAATAA 

t219.nt 

TGCAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTTTAGATAAAAACAAAATTGAATATAAAATAG 

ACTCAGAAAATGACTTTATAGCATTTAAAGATATAAACAATAACGAAAAAGAAGAAGTAATCATCAGATCAAGACT 

AAACTCATATAAAAATTCAAAGATAAGAGAAATATTTGGAATTGTTAAAGTATTTGATATAAACACACCA^ 

AAAGAAATATCTGACTCGCTTATGAGCGATAGTTATAATAACAGAGTATTTGGATCGTGGGAGATTATTCATAATG 

CAGAAAGAGGAATCAACTCTTTGGTATATATTGTAAAAGCAGAAGAATTTGCAAATGATACAT^ 
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AATTGATGAGATTGCCTCAACAATAAGTATTTTCAAAAAAATAATAACAACCAACAACGAAAACATTGATAATAAT 
GAAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAAGAAAAAACAAATTCAACAAAAGAAT 
CTAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATCAAAGCCCAATAA 

f229.aa 

MRVDLLPLVELSLYINLSFCCKDFSIFNRILEELKCHLILLGHPIIKTLYIKHVDFCLSRQDNLKFIFTSLSKYIN 

LELLEEFTLEIIPGYVDFEKFKLLDEFCITRINLlWQSFSLEFRKIVGIPEISYKKIJSflLINNIRKFPFDLJ^ 

VNMPLQKKSHLKRDLQRIAFIYAZ 

t229.aa 

CKDFSIFNRILEELKCHLILLGHPIIKTLYIKHVDFCLSRQDNLKFIFTSLSKYINLELLEEFTLEIIPGYVDFEK 
FKLLDEFCITRINLNVQSFSLEFRKIVGIPEISYKKLNILINNIRKFPFDLNIDMTVNMPLQKKSHLKRDLQRIAF 
lYAZ 

f229.nt 

ATGAGAGTAGATCTTTTACCTCTTGTCGAGTTAAGTCTTTATATTAATTTGTCATTTTGTTGTAAAGATTTTAGCA 
TTTTTAATAGAATTTTAGAGGAATTAAAATGTCATTTAATCTTGCTGGGTCATCCAATTATAAAAACACTTTACAT 
TAAGCACGTAGATTTTTGTTTATCTAGGCAAGATAATTTAAAATTTATTTTCACTTCTTTGTCCAAGTATATTAAT 
TTGGAGTTATTAGAAGAATTTACTTTAGAAATTATTCCGGGTTATGTTGATTTTGAAAAATTCAAACTTTTGGATG 
AATTTTGTATTACTAGAATTAATCTTAATGTTCAAAGTTTTTCTTTAGAGTTTAGAAAGATTGTGGGGATACCCGA 
AATTTCTTATAAAAAATTGAATATTTTGATTAACAATATTAGAAAGTTTCCTTTTGATTTGAATATTGACATGACT 
GTCAATATGCCTTTGCAAAAAAAATCTCATCTCAAGCGAGATTTGCAAAGAATTGCTTTCATATATGCCTGA 

t229 .nt 

TGTAAAGATTTTAGCATTTTTAATAGAATTTTAGAGGAATTAAAATGTCATTTAATCTTGCTGGGTCATCCAATTA 
TAAAAACACTTTACATTAAGCACGTAGATTTTTGTTTATCTAGGCAAGATAATTTAAAATTTATTTTCACTTCTTT 
GTCCAAGTATATTAATTTGGAGTTATTAGAAGAATTTACTTTAGAAATTATTCCGGGTTATGTTGATTTTGAAAAA 
TTCAAACTTTTGGATGAATTTTGTATTACTAGAATTAATCTTAATGTTCAAAGTTTTTCTTTAGAGTTTAGAAAGA 
TTGTGGGGATACCCGAAATTTCTTATAAAAAATTGAATATTTTGATTAACAATATTAGAAAGTTTCCTTTTGATTT 
GAATATTGACATGACTGTCAATATGCCTTTGCAAAAAAAATCTCATCTCAAGCGAGATTTGCAAAGAATTGCTTTC 
ATATATGCCTGA 

f 22 -aa 

MLKTLTKIITISCLIVGCASLPYTPPKQNL^^^LMELLPGANLYAHVNLIKNRSIYNSLSPKYKSVLGLISNLYFSY 
KKENNDFALLIMGNFPKDIFWGIHKNRNTESIGNIFTNPKWKLKJSISNIYIIPNKARTSIAITQKDITA^ 
KYIGEIEKNEMFFWIQDPTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVL 
TNMTNLTISSHIKTTIKDQNTVEIEFNIQKSSVESLIEKLASNIQT 

t22 ,aa 

CASLPYTPPKQNLNYLJ!4ELLPGANLYAHVNLIKNRSIYNSLSPKYKSVLGLISNLYFSYKKENNDFALLIMGN^ 
DIFWGIHKlvmNTESIGNIFTNPKWKLKNSNIYIIPNKARTSIAITQKDITAKDNlSM 

PTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVLTNMTNLTISSHIKTTIK 
DQNTVE I EFN I QKS S VES L I EKLASNIQT 

f22.nt 

ATGTTAAAAACATTAACAAAAATAATTACCATTTCATGCCTCATAGTGGGATGCGCAAGCCTGCCTTACACTCCTC 
CAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTATACGCCCATGTAAATTTAATTAAA 
CAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTTATACTl^ 
AAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGATATTTTCTGGGGA^ 
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ATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAAAATTCAAATATATACATTAT 
TCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGCAAAAGACAATAATATGCTAACAACA 
AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGATCCAACATTATTGCTCCCAAACCAAA 
TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATCAAGAAGAATATAT 
TTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAAAAAAGTTAATTCCAACCGTCTTG 
ACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACGGTTGAAATAGAAT 
TTAATATTCAAAAATCTAGTGTTGAAAGCCTTATAGAAAAACTAGCTTCAAATATTCAAACCTAA 

t22 .nt 

TGCGCAAGCCTGCCTTACACTCCTCCAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTAT 
ACGCCCATGTAAATTTAATTAAAAACAGGTCTATTTATAACTCTTTT^GCCCTAAATATAAATCAGTTCTTGGGCT 
TATAAGCAATTTATACTTTAGCTATAAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAA 
GATATTTTCTGGGGAATTCATAAAAATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAAC 
TTAAAAATTCAAATATATACATTATTCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGC 
AAAAGACAATAATATGCTAACAACAAAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGAT 
CCAACATTATTGCTCCCAAACCAAATAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAA 
ACAGCTTAAATCAAGAAGAATATATTTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTC 
AAAAAAGTTAATTCCAACCGTCTTGACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAA 
GACCAAAATACGGTTGAAATAGAATTTAATATTCAAAAATCTAGTGTTGAAAGCCTTATAGAAAAACTAGCTTCAA 
ATATTCAAACCTAA 

£32. aa 

MNTKTLYLISLILLACNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEW 
KIEKTLEKTERYGIEGNWILVNYKGTKRYIFSKDINIVNNLIIDHSK 

t32 .aa 

CNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEWKIEKTLEKTERYGIE 
GNWILVNYKGTKRYIFSKDINIVNNLIIDHSK 

f32.nt 

ATGAATACAAAAACATTATATTTAATATCCTTAATTCTTTTAGCTTGCAATAAAAATAACAAAATTCCTCTCATTC 
AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATAAAATGGGCATAATAATAAAAGATTATGCTTT 
TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTACGCAATTCTACTCAGAAAAGACGAAGTCGTA 
AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAAGGAAATTGGATCCTAGTCAATTACAAGGGAA 
CTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATTTAATAATTGATCATTCTAAATAG 

t32 .nt 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATA 
AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTA 
CGCAATTCTACTCAGAAAAGACGAAGTCGTAAAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAA 
GGAAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATT 
TAATAATTGATCATTCTAAATAG 

fl86.aa 

MKKLIIIFTLFLSQACNLSTMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTL 

FGTTPMQRIHKYIX^SFNLTREILJVSGIELJSrRVVNAWU^SPSHKEALIOTDTDKIGGY 

YKN 




PCTAJS98/12718 
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CNLSTMHKIDTKEDMKILYSEIAELRKKLNLKTHLEIDDTLEKVAKEYAIKLGENRTITHTLFGTT 

PMQRI HKYDQS FNLTRE I LASG I ELNRWNAWLNS PSHKEAL I NTDTDKI GGYRLKTTDNI DI FWLFGKRKYKN 

fl86.nt 

ATGAAAAAATTGATTATAATTTTTACACTGTTTTTATCTCAAGCATGCAATTTAAGTACAATGCATAAAATAGATA 
CAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGAGAAAAAAATTAAATCTAAACCATCTAGAAAT 
AGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACTGGGAGAAAATAGAACAATAACTCACACCCTT 
TTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCCTTTAATTTAACAAGAGAAATACTGGCATCAG 
GAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAAGCCACAAAGAAGCTCTTATTAATACAGATAC 
CGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATATATTTGTAGTTCTTTTTGGAAAAAGAAAA 
TATAAGAATTGA 

tl86 .nt 

TGCAATTTAAGTACAATGCATAAAATAGATACAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGA 
GAAAAAAATTAAATCTAAACCATCTAGAAATAGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACT 
GGGAGAAAATAGAACAATAACTCACACCCTTTTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCC 
TTTAATTTAACAAGAGAAATACTGGCATCAGGAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAA 
GCCACAAAGAAGCTCTTATTAATACAGATACCGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGA 
TATATTTGTAGTTCTTTTTGGAAAAAGAAAATATAAGAATTGA 

f216.aa 

MIRVLLGSLAVSFLFSIClWFIJSrYDNLFSKKVFYFHSSKGFVANLRYLRDEQNLKDNLDLLVKDFLLGSNEGFSFG 

FLLSDSRFLYSFLKNGVYYWLSREFYDSFNNGDYNESNESFDVKVNLFAMSLIKTMRFNYPGKIKKIVI^^ 

LKEQS 

t216 .aa 

CMVFLNYDNLFSKKVFYFHSSKGFVANLRYLRDEQNLKDNLDLLVKDFLLGSNEGFSFGFLLSDSRFLYSFLKNGV 
YYVNLSREFYDSFNNGDYNESNESFDVKWLFAMSLIKTMRFm:'PGKIKKIVILVEGCILKEQS 

f216.nt 

ATGATTAGGGTGCTTTTGGGGTCTTTGGCAGTAAGCTTTTTGTTTTCTATTTGTATGGTTTTTTTAAATTA 

ATCTTTTTTCAAAAAAGGTTTTTTATTTTCATTCTAGCAAGGGATTTGTTGCTAATTTAAGATATTTAAGAGATGA 

ACAAAATTTGAAAGATAATTTAGATCTTTTAGTAAAAGATTTTCTTTTAGGAAGCAATGAAGGGTTTTCTTTTGGG 

TTTTTATTAAGTGATTCAAGATTTTTATATTCTTTTTTAAAGAATGGAGTTTATTATGTAAATCTTTCAAGAGAAT 

TTTATGATTCTTTTAATAATGGTGATTATAATGAATCTAATGAATCTTTTGATGTTAAGGTCAATCTTTT^ 

GTCTTTAATAAAAACAATGCGCTTTAACTATCCTGGTAAGATAAAAAAGATTGTTATTCTTGTTGAAGGGTGTATC 

TTAAAGGAGCAAAGTTGA 

t216 .nt 

TGTATGGTTTTTTTAAATTATGATAATCTTTTTTCAAAAAAGGTTTTTTATTTTCATTCTAGCAAGGGATTTGTTG 

CTAATTTAAGATATTTAAGAGATGAACAAAATTTGAAAGATAATTTAGATCTTTTAGTAAAAGATTTTCTTTTAGG 

AAGCAATGAAGGGTTTTCTTTTGGGTTTTTATTAAGTGATTCAAGATTTTTATATTCTTTTTTAAAGAATGGAGT^ 

TATTATGTAAATCTTTCAAGAGAATTTTATGATTCTTTTAATAATGGTGATTATAATGAATCTAATGAATCTTTTG 

ATGTTAAGGTCAATCTTTTTGCTATGTCTTTAATAAAAACAATGCGCTTTAACTATCCTGGTAAGATAAAAAA^ 

TGTTATTCTTGTTGAAGGGTGTATCTTAAAGGAGCAAAGTTGA 

f328.aa 

MAIKYARE^^SrIPFLGICLGLQtAVIEFARNVCGILDADTEENLARDKPLKSPVIHLLPEQKGIKB 

VILKKNTIAFKliYGQDRIIERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKM^ 

lENPAKLFLGLIKACI 
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t328 .aa 

CI^LQLAVIEFARNVCGILDAm'EENLARDKPLKSPVIHLLPEQKGIKDKGATMRLGGyPVILKK^ 

RIIERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKMAKLIEIPENKFFVACQFHPELITRIENPAKLF 

LGLIKACI 

£328. nt 

ATGGCTATTAAATATGCTCGTGAGAATAATATTCCCTTTCTTGGAATTTGTCTTGGTTTGCAGCTTGCTGTAATAG 
AATTTGCTCGTAATGTTTGTGGAATACTTGATGCTGATACGGAGGAAAATTTAGCAAGAGACAAGCCCTTAAAAAG 
TCCTGTTATCCATTTACTTCCTGAGCAAAAGGGAATTAAAGATAAGGGCGCTACAATGAGGCTTGGTGGATATCCT 
GTGATTCTTAAAAAGAATACAATAGCTTTTAAACTTTATGGCCAAGATCGGATAATTGAAAGATTTAGACATAGGT 
ATGAAGTCAATAATGATTATATAGATTTATTTGCAAAAAATGGGCTTATAGTATCTGGATTTTCAAGTGATTTTAA 
AATGGCAAAATTAATAGAAATTCCTGAAAATAAATTTTTCGTAGCTTGCCAGTTTCATCCAGT^CTTATTACAAGA 
ATAGAAAATCCAGCCAAGCTTTTTCTAGGATTAATTAAAGCTTGTATTTGA 

t328.nt 

TGTCTTGGTTTGCAGCTTGCTGTAATAGAATTTGCTCGTAATGTTTGTGGAATACTTGATGCTGATACGGAGGAAA 
ATTTAGCAAGAGACAAGCCCTTAAAAAGTCCTGTTATCCATTTACTTCCTGAGCAAAAGGGAATTAAAGATAAGGG 
CGCTACAATGAGGCTTGGTGGATATCCTGTGATTCTTAAAAAGAATACAATAGCTTTTAAACTTTATGGCCAAGAT 
CGGATAATTGAAAGATTTAGACATAGGTATGAAGTCAATAATGATTATATAGATTTATTTGCAAAAAATGGGCTTA 
TAGTATCTGGATTTTCAAGTGATTTTAAAATGGCAAAATTAATAGAAATTCCTGAAAATAAATTTTTCGTAGCTTG 
CCAGTTTCATCCAGAACTTATTACAAGAATAGAAAATCCAGCCAAGCTTTTTCTAGGATTAATTAAAGCTTGTATT 
TGA 

f352.aa 

mnktknrsltyfiilscislfgannntisyssieipledlseefkssgnksdqintskhlnknivsyedpkkgkdl 
klpenirdkklpqkrmdendlksvienyenkikniekllktknqktsenenkkiesiekkakkyeiltnklkneiv 
eikkllnkkikpkedenyekinienieeetdddfednyeym:>eiextnednypsnegiinnlkenlnenekyyain 
ekkideledrinenentildlqrelrnfkkkdnsdknleeieenlssigriindlkrkisaneainkenqkkirtd 
khklkeledkikeneetilklqkelnnfkkkeiyqkplneetftpsitsknddleenkklkkeylkpiekkesrdl 
eentkstpkttmiktadfqiypdiylnnykfkekgdqfafkkentyyieidptnnlnealknheiiskykfekyfi 
npilknkeeffrnlievknihelgimyknlkpefkqikiik 

t352 .aa 

CISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYEDPKKGKDLKLPENIRDKKLPQKRM 

DENDLKSVIENYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTNKLKNEIVTIIKKLLNKKIKPK^ 

NYEKINIENIEEETDDDFEDlSnirEYNDEIEXTNEDNYPSNEGIimLKENIJSrENEKYY^ 

tildlqrelrnfkkkdnsdknleeieenlssigriindlkrkisaneainkenqkkirtdkhklkeledkikenee 
tilklqkelimfkkkeiyqkplneetftpsitsknddleenkklkkeylkpiekkesrdleentkstpkttmikta 
dfqiypdiylnnykfkekgdqfafkkentyyieidptnnlnealknheiiskykfekyfinpilknkeeff^ 
vknihelgimyknlkpefkqiki ik 

f352.nt 

ATGAATAAAACAAAAAATCGAAGCCTTACGTATTTTATAATACTTTCATGTATATCATTATTTGGGGCTAATAATA 

ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTGAAGAATTTAAAAGTTCTGGGAATAAAAG 

CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTATGAAGACCCAAAAAAGGGTAA^ 

AAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATGGACGAAAATGATCTAAAATCTGTAATTC 

AAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAAAACCAAAAATCAAAAAACATCGGAAAAT^^ 

TAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAAATTTTAACCAAT/^AATTAA 

GAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAAAATTACGAAAAAATAAATATTC 
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TTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATGATGAAATTGAAGAACAAATGAC^ 

TACCCTTCTAATGAAGGAATAATAAACAATCTAAAAGAAAATCTTAATGAAAACGAAAAATATTATGCTATTAATG 

AAAAAAAAATCGATGAACTTGAAGACAGAATCAACGAGAATGAAAACACTATTTTAGACTTGCAAAGAGAATTAAG 

GAATTTTAAAAAAAAAGATAACTCAGATAAAAACTTAGAAGAAATTGAGGAAAATTTATCTTCAATAGGAAGAATA 

ATTAATGATCTAAAAAGAAAAATCAGCGCAAATGAAGCAATAAACAAAGAAAATCAAAAAAAAATAAGAACTGATA 

AACACAAACTCAAAGAATTAGAAGATAAAATAAAGGAAAATGAAGAGACTATTTTAAAACTTCAAAAAGAATTA^ 

CAATTTTAAAAAAAAAGAAATTTATCAAAAACCCTTAAATGAAGAAACTTTCACTCCAAGCATTACAAGTAAAAAT 

GACGACTTAGAAGAAAATAAGAAATTAAAAAAGGAATATTTAAAGCCCATAGAAAAAAAAGAAAGCCGAGATCTAG 

AAGAAAATACTAAAAGCACCCCAAAAACAACTATGATAAAAACAGCAGATTTTCAAATCTACCCTGACATATATCT 

TAATAATTATAAATTTAAAGAAAAGGGAGATCAATTTGCATTTAAAAAAGAAAACACATACTATATTGAAATAGAT 

CCCACTAACAATTTAAATGAGGCTTTAAAAAATCATGAAATAATCTCAAAATATAAATTTGAAAAATATTTCATTA 

ACCCTATTCTAAAAAATAAAGAAGAATTTTTTAGAAACTTAATAGAAGTCAAAAATATCCACGAACTAGGAATTAT 

GTATAAAAATCTAAAGCCTGAATTTAAGCAAATAAAAATAATTAAATAA 

t352 .nt 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTG 

AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTA 

TGAAGACCCAAAAAAGGGTAAAGATCTAAAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATG 

GACGAAAATGATCTAAAATCTGTAATTGAAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAAAACCA 

AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAATATGAA^ 

AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGATGAA 

AATTACGAAAAAATAAATATTGAAAACATTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATG 

ATGAAATTGAAGAACA/^TGAGGACAATTACCCTTCTAATGAAGGAATAATAAACAATCTAAAAGAAAATCTTAAT 

GAAAACGAAAAATATTATGCTATTAATGAAAAAAAAATCGATGAACTTGAAGACAGAATCAACGAGAATGAAAACA 

CTATTTTAGACTTGCAAAGAGAATTAAGGAATTTTAAAAAAAAAGATAACTCAGATAAAAACTTAGAAGAAATTGA 

GGAAAATTTATCTTCAATAGGAAGAATAATTAATGATCTAAAAAGAAAAATCAGCGCAAATGAAGCAATAAACAAA 

GAAAATCAAAAAAAAATAAGAACTGATAAACACAAACTCAAAGAATTAGAAGATAAAATAAAGGAAAATGAAGAGA 

CTATTTTAAAACTTCAAAAAGAATTAAACAATTTTAAAAAAAAAGAAATTTATCAAAAACCCTTAAATGAAGAAAC 

TTTCACTCCAAGCATTACAAGTAAAAATGACGACTTAGAAGAAAATAAGAAATTAAAAAAGGAATATTTAAAGCCC 

ATAGAAAAAAAAGAAAGCCGAGATCTAGAAGAAAATACTAAAAGCACCCCAAAAACAACTATGATAAAAACAGCAG 

ATTTTCAAATCTACCCTGACATATATCTTAATAATTATAAATTTAAAGAJ\AAGGGAGATCAATTTGCATTTAAAAA 

AGAAAACACATACTATATTGAAATAGATCCCACTAACAATTTAAATGAGGCTTTAAAAAATCATGAAATAATCTCA 

AAATATAAATTTGAAAAATATTTCATTAACCCTATTCTAAAAAATAAAGAAGAATTTTTTAGAAACTTAATAGAAG 

TCAAAAATATCCACGAACTAGGAATTATGTATAAAAATCTAAAGCCTGAATTTAAGCAAATAAAAATAATTAAATA 

A 

f867.aa 

M1^KGKWGVNGNLWIEVEGSVS^1NEVLFVKTAGRNLKAEVIRIRGNEVDAQVFELTKGISVGDLVEF^ 

ELGPGLLTQWDGLQNPLPELAIQCGFFLERGVYLRPLNKDKKWNFKKTSKVGDIVIAGDFLGFVIEGTVHHQIMI 

PFYKRDSYKIVEIVSDGDYSIDEQIAVIEDDSGMRHNITMSFHWPVKVPITNYKERLIPSEPMLTQTRIIDTFFPV 

AKGGTFCI PGPFGAGKTVLQQVTSKNADVDWI lAACGERAGEWETLKEFPEI^DPKTGKSIJ!^^^ 

PVAAREASVYTAITIGEYYRQMGLDILLLADSTSRWAQAMREMSGRLEEIPGEEAFPAYLESVIASFYERAGIWIi 

NNGDIGSVTVGGSVSPAGGNFEEPVTQATLKWGAFHGLTRERSDARKFPAISPIiESWSKYKGVIDQKKTEYARSF 

LVKGNEINQMMKWGEEGISNDDFLIYLKSELLDSCYLQQNSFDSIDAAVSSERQNYMFDIVYNILK^ 

QARDFINELRQNLLDMNIiSSFKDHKFNKLEHALGELINFKKVI 

t867 .aa 

grnlkaevirirgnevdaqvfeltkgisvgdlveftdklltvelgpglltqwixslqnplpeiiaiqcgfflergw 

lrplnkdkkwnfkictskvgdiviagdfu3fviegtvhhqimipfykrdsykiveivsix3dysideqia 

rhnitmsfhwpvkvpitnykerli psepmltqtri idtffpvakggtfci pgpfgagktvlqqvtsrna^ 

aacgeragevvt:tlk^fpelmdpktgkslmdrtciicntssmpvaareasw 

rwaqamremsgrleeipgeeafpaylesviasfyeragiwlnngdigsvwggsvspac^ 

afhgltrersdarkfpaispleswskykgvidqkkteyarsflvkgneinqmmkwgeegisnddfliylkselld 
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SCYLQQNSFDSIDAAVSSERQNYMFDIVYNILKTNFEFSDKLQARDFINELRQNLLDMNLSSFKDHKFNKLEHALG 
ELINFKKVI 

f867.nt 



ATGAATACAAAAGGAAAAGTCGTTGGAGTTAATGGAAACTTAGTTACTATTGAGGTAGAAGGTTCAGTTTCTATGA 

ATGAAGTTTTATTTGTAAAGACTGCTGGTAGGAATTTAAAAGCAGAAGTAATTCGTATTAGGGGCAATGAAGTTGA 

TGCACAGGTTTTTGAATTGACAAAAGGGATATCTGTTGGAGACCTAGTTGAATTTACAGACAAACTTTTAACAGTT 

GAACTCGGACCAGGGCTTTTAACTCAAGTATATGATGGGCTTCAAAATCCTTTGCCTGAATTGGCTATTCAATGTG 

GATTTTTTTTAGAAAGGGGAGTATATTTAAGGCCCTTGAATAAAGATAAAAAGTGGAATTTTAAAAAAACCTCCAA 

AGTTGGAGATATCGTTATTGCAGGAGATTTTTTAGGTTTTGTAATTGAGGGAACTGTTCACCATCAAATAATGATT 

CCATTTTATAAAAGGGATTCTTATAAAATTGTGGAGATTGTAAGTGATGGCGACTATTCGATTGATGAGCAAATTG 

CTGTAATTGAAGATGATTCTGGTATGAGGCATAATATTACAATGTCTTTTCATTGGCCTGTTAAAGTTCCTATTAC 

TAATTATAAGGAACGCCTTATTCCTAGTGAACCTATGTTGACTCAAACTAGAATTATAGATACATTTTTCCCAGTT 

GCCAAAGGTGGAACTTTTTGCATTCCGGGTCCTTTTGGAGCAGGAAAAACGGTTCTTCAGCAGGTTACAAGTCGAA 

ATGCTGATGTTGATGTAGTGATTATTGCAGCTTGTGGTGAGCGAGCAGGAGAAGTGGTAGAAACTCTTAAAGAATT 

TCCCGAATTAATGGATCCAAAAACCGGCAAATCTTTAATGGACAGGACTTGTATTATTTGTAATACATCTTCAATG 

CCAGTTGCAGCTAGAGAAGCTTCTGTTTATACTGCTATTACTATTGGTGAGTATTACAGGCAAATGGGCCTTGATA 

TTCTTCTTTTGGCAGATTCAACTTCAAGATGGGCTCAAGCAATGAGAGAAATGTCTGGACGCCTTGAGGAAATTCC 

TGGCGAGGAGGCTTTTCCGGCATATCTTGAGTCTGTTATTGCTTCCTTTTATGAAAGGGCAGGTATTGTAGTTCTT 

AATAATGGGGATATTGGATCTGTAACAGTTGGTGGCTCTGTAAGTCCTGCTGGTGGTAATTTTGAAGAGCCAGT^ 

CTCAAGCAACTTTAAAAGTTGTAGGAGCATTTCACGGGCTTACAAGAGAAAGGTCTGATGCTAGGAAATTTCCAGC 

TATTAGTCCTCTTGAATCTTGGAGTAAATATAAAGGCGTTATTGATCAAAAAAAGACTGAATATGCAAGATCTTTT 

TTGGTGAAAGGTAATGAAATTAATCAAATGATGAAAGTTGTTGGAGAAGAAGGCATAAGTAACGATGATTTTTTAA 

TTTATTTAAAATCCGAGCTACTTGATTCGTGCTATTTGCAGCAAAATTCATTTGATTCTATTGATGCTGCTGTTAG 

TTCAGAGCGTCAAAATTATATGTTTGATATAGTTTATAACATTCTTAAAACTAACTTTGAGTTTTCTGATAAACTT 

CAAGCAAGAGATTTTATAAATGAGTTAAGGCAAAATCTTTTAGACATGAATCTTTCTTCTTTTAAGGATCATAAGT 

TTAATAAATTGGAGCATGCTTTGGGTGAATTGATAAATTTTAAAAAGGTAATTTAG 

t867 .nt 

GGTAGGAATTTAAAAGCAGAAGTAATTCGTATTAGGGGCAATGAAGTTGATGCACAGGTTTTTGAATTGACAAAAG 
GGAT 

ATCTGTTGGAGACCTAGTTGAATTTACAGACAAACTTTTAACAGTTGAACTCGGACCAGGGCTTTTAACTCAAGTA 

TATGATGGGCTTCAAAATCCTTTGCCTGAATTGGCTATTCAATGTGGATTTTTTTTAGAAAGGGGAGTATATTTAA 

GGCCCTTGAATAAAGATAAAAAGTGGAATTTTAAAAAAACCTCCAAAGTTGGAGATATCGTTATTGCAGGAGATTT 

TTTAGGTTTTGTAATTGAGGGAACTGTTCACCATCAAATAATGATTCCATTTTATAAAAGGGATTCTTATAAAATT 

GTGGAGATTGTAAGTGATGGCGACTATTCGATTGATGAGCAAATTGCTGTAATTGAAGATGATTCTGGTATGAGGC 

ATAATATTACAATGTCTTTTCATTGGCCTGTTAAAGTTCCTATTACTAATTATAAGGAACGCCTTATTCCTAGTGA 

ACCTATGTTGACTCAAACTAGAATTATAGATACATTTTTCCCAGTTGCCAAAGGTGGAACTTTTTGCATTCCGGGT 

CCTTTTGGAGCAGGAAAAACGGTTCTTCAGCAGGTTACAAGTCGAAATGCTGATGTTGATGTAGTGATTATTGCAG 

CTTGTGGTGAGCGAGCAGGAGAAGTGGTAGAAACTCTTAAAGAATTTCCCGAATTAATGGATCCAAAAACCGGCAA 

ATCTTTAATGGACAGGACTTGTATTATTTGTAATACATCTTCAATGCCAGTTGCAGCTAGAGAAGCTTCTGTTTAT 

ACTGCTATTACTATTGGTGAGTATTACAGGCAAATGGGCCTTGATATTCTTCTTTTGGCAGATTCAACTTCAAGAT 

GGGCTCAAGCAATGAGAGAAATGTCTGGACGCCTTGAGGAAATTCCTGGCGAGGAGGCTTTTCCGGCATATCTTGA 

GTCTGTTATTGCTTCCTTTTATGAAAGGGCAGGTATTGTAGTTCTTAATAATGGGGATATTGGATCTGTAACAGTT 

GGTGGCTCTGTAAGTCCTGCTGGTGGTAATTTTGAAGAGCCAGTTACTCAAGCAACTTTAAAAGTTGTAGGAGCAT 

TTCACGGGCTTACAAGAGAAAGGTCTGATGCTAGGAAATTTCCAGCTATTAGTCCTCTTGAATCTTGGAGTAAATA 

TAAAGGCGTTATTGATCAAAAAAAGACTGAATATGCAAGATCTTTTTTGGTGAAAGGTAATGAAATTAATCAAATC 

ATGAAAGTTGTTGGAGAAGAAGGCATAAGTAACGATGATTTTTTAATTTATTTAAAATCCGAGCTACTTGAOT 

GCTATTTGCAGCAAAATTCATTTGATTCTATTGATGCTGCTGTTAGTTCAGAGCGTCAAAATTATATGTTTGATAT 

AGTTTATAACATTCTTAAAACTAACTTTGAGTTTTCTGATAAACTTCAAGCAAGAGATTTTATA/^TGAGTTAAGG 

CAAAATCTTTTAGACATGAATCTTTCTTCTTTTT^GGATCATAAGTTTAATAAATTGGAGCATGCT^ 

TGATAAATTTTAAAAAGGTAATTTAG 
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f 868 .aa 

MKRVYSKIESIAGNVITWAQGIKYGEIiAIVKAKDTSSLAEVIKLDREKVSLQVYGGTRGVSTSDEIKFLGHS 
SFSDNLIX3RIFEK3SGNPRDGGPSLDDNLIEIGGPSANPTKRIVPR1MIRTGLPMIDVFNTLVESQKLPIFSVSGE 
YNELLIRI ALQAEVDL 1 1 LGGMGLKHDDYLTFKDSLEKGGALSRAI FFVHTANDS WESLTVPDI SLSVAEKFALK 
GKKVLVLLTDMTNFADAMKE I S I TMEQVP SNRG YPGDLYSQLiAYRYEKAI DFEGAG S I T I li AVTTM P 
DNTGYITEGQYYLKGGRIEPFGSLSRLKQMVNSRTRDDHRTIMDSMIKLYASSKESVEKKAMGFNMTKWDEKL 
SNMFESKMMDLSVNIPLEEALDLGWSILASCFSPKETGIKTDLIEKYWPKKETY 

t868 .aa 

QGIKYGELAIVKAKDTSSIJ^EVIKLDREKVSLQVYGGTRGVSTSDEIKFLGHSMQVSFSDNLLGRIFDGSGNPRDG 

GPSLDDNLIEIGGPSANPTKRIVPKNMIRTGLPMIDVFNTLVESQKLPIFSVSGEPYNELLIRIALQAEVDLIILG 

GMGLKHDDYLTFKDSLEKGGALSRAIFFVHTANDSWESLTVPDISLSVAEKFALKGKKVLVLLTDMTNFADAMKE 

ISITMEQVPSNRGYPGDLYSQLAYRYEKAIDFEGAGSITILiAVTTMPGDDVTHPVPDNTGYITEGQYYLKGGRIEP 

FGSLSRLKQMVNSRTRDDHRTIMDSMIKLYASSKESVEKKAMGFl^TKWDEKLLKYSNMFESK^^ 

LDLGWS I LASC F S PKETG I KTDL I EK YWPKKETY 

f868.nt 

ATGAAAAGAGTCTATAGTAAAATAGAGTCTATAGCAGGCAATGTAATAACTGTTACAGCTCAAGGTATTAAGTATG 
GTGAGCTTGCTATTGTAAAAGCAAAAGATACAAGTTCTCTAGCCGAAGTAATTAAACTTGATCGAGAAAAAGTTTC 
TCTTCAGGTTTATGGTGGTACAAGAGGTGTTTCCACGTCAGACGAGATAAAGTTTTTAGGGCATTCAATGCAGGTT 
TCATTTTCTGACAATTTGTTGGGCAGAATTTTTGATGGTTCTGGGAATCCTAGAGATGGGGGCCCTTCTCTTGATG 
ATAATTTGATTGAAATTGGTGGGCCTTCTGCAAATCCTACAAAACGCATTGTTCCTAGAAATATGATAAGGACAGG 
GCTTCCAATGATAGATGTTTTTAATACTCTTGTTGAATCTCAJ\AAATTGCCAATTTTTTCTGTTTCTGGTGAGCCT 
TATAATGAGCTTCTTATAAGAATTGCACTTCAAGCAGAAGTTGATTTAATAATTCTTGGCGGAATGGGACTTAAGC 
ATGATGATTATTTAACTTTTAAAGATTCTTTAGAAAAGGGAGGTGCTTTAAGTAGAGCAATTTTTTTTGTTCATAC 
TGCTAATGATTCTGTTGTTGAATCTTTAACTGTTCCTGATATTTCACTTTCTGTTGCTGAAAAGTTTGCTCTAAAG 
GGCAAAAAAGTTTTGGTGCTTCTCACAGACATGACAAATTTTGCTGATGCAATGAAAGAAATATCTATTACAATGG 
AACAAGTGCCTTCTAATAGAGGTTATCCCGGGGATTTGTATTCTCAGCTTGCATATCGTTATGAGAAGGCTATTGA 
CTTTGAAGGCGCAGGATCAATTACAATACTTGCAGTTACAACAATGCCGGGTGACGATGTTACTCATCCTGTTCCT 
GACAATACTGGATACATTACAGAAGGTCAATACTATTTAAAAGGTGGCAGAATAGAGCCTTTTGGGTCTCTTTCAA 
GACTTAAGCAAATGGTAAATAGTAGAACTAGAGACGATCACAGGACTATAATGGATTCAATGATCAAGCTTTATGC 
ATCTTCAAAAGAGTCTGTAGAAAAAAAGGCTATGGGATTTAATATGACTAAGTGGGATGAAAAATTGCTCAAGTAT 
AGCAATATGTTTGAAAGTAAGATGATGGATTTGTCTGTTAATATTCCTTTAGAAGAGGCTTTAGATTTAGGTTGGA 
GCATTCTTGCTAGTTGTTTTAGCCCAAAAGAAACGGGAATAAAAACAGATCTTATTGAAAAATATTGGCCTAAAAA 
AGAGACTTATTGA 

t868 .nt 

CAAGGTATTAAGTATGGTGAGCTTGCTATTGTAAAAGCAAAAGATACAAGTTCTCTAGCCGAAGTAATTAAACTTG 
ATCGAGAAAAAGTTTCTCTTCAGGTTTATGGTGGTACAAGAGGTGTTTCCACGTCAGACGAGATAAAGTTTTTAGG 
GCATTCAATGCAGGTTTCATTTTCTGACAATTTGTTGGGCAGAATTTTTGATGGTTCTGG^ 

GGCCCTTCTCTTGATGATAATTTGATTGAAATTGGTGGGCCTTCTGCAAATCCTACAAAACGCATTGTTCCTAGAA 
ATATGATAAGGACAGGGCTTCCAATGATAGATGTTTTTAATACTCTTGTTGAATCTCAAAAATTGCCAATTTTTTC 
TGTTTCTGGTGAGCCTTATAATGAGCTTCTTATAAGAATTGCACTTCAAGCAGAAGTTGATTTAATAATTCTTGGC 
GGAATGGGACTTAAGCATGATGATTATTTAACTTTTAAAGATTCTTTAGAAAAGGGAGGTGCTTTAAGTAGAGCAA 
TTTTTTTTGTTCATACTGCTAATGATTCTGTTGTTGAATCTTTAACTGTTCCTGAT^^ 

AAAGTTTGCTCTAAAGGGCAAAAAAGTTTTGGTGCTTCTCACAGACATGACAAATTTTGCTGATGCAATGAAAGAA 

ATATCTATTACAATGGAACAAGTGCCTTCTAATAGAGGTTATCCCGGGGATTTGTATTCTCAGCTTGCATATCG^ 

ATGAGAAGGCTATTGACTTTGAAGGCGCAGGATCAATTACAATACTTGCAGTTACAACAATGCCGGGTGACGATC 

TACTCATCCTGTTCCTGACAATACTGGATACATTACAGAAGGTCAATACTATTTAAAAGGTGGCAGAATAGAGCCT 

TTTGGGTCTCTTTCAAGACTTAAGCAAATGGTAAATAGTAGAACTAGAGACGATCAC^ 

TGATCAAGCTTTATGCATCTTCAAAAGAGTCTGTAGAAAAAAAGGCTATGGGATTO 

AAAATTGCTCAAGTATAGCAATATGTTTGAAAGTAAGATGATGGATTTGTCTGTTA^ 
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TTAGATTTAGGTTGGAGCATTCTTGCTAGTTGTTTTAGCCCAAAAGAAACGGGAATAAAAACAGATCTTATTGAAA 
AATATTGGCCTAAAAAAGAGACTTATTGA 

f 872 .aa 

MRSAVLFFFALPFSISLYSSSNKNFPYWILLEKGRQFLYSKSEFSKSNLTHAINYLQEALLRKGVYPEASYYLSVA 
YGMSGNAILEKLNLYKSFEDRYYLLDESFEKKILFSLAKI^ELENNYVDTIDYLITO 

ENSMSNNELNASFYLTSYLKQWGAFGIDFTFNLYRFKNYWIDTHQLLSKVYLHLKAYELSITHGLIAAVG 

MYDYVCYYEPWQFKNLRSFVQKINKYKAIKNAFESTDFWEIVYWAAATYAYSNGlSri^KFR^ 

SPYIAKSRSQIKNSVYLKKN 

t872 .aa 

SNKNFPYWILLEKGRQFLYSKSEFSKSNLTHAINYLQEALLRKGVYPEASYYLSVAYGMSGNAILEKLNLYKSFED 
RYYLLDESFEKKILFSLAKMAELENNYVDTIDYLNDILNKFSTKKDYYSYHDYSQGENSMSNNEI^ 
QVRGAFGIDFTFNLYRFKNYl^IDTHQLLSKVYLHLKAYELSITHGLIAAVGILTRMYDYVCYYEPVYQFKNL^ 
VQKINKYKAIKNAFESTDFWEIVYNVAAATYAYSNGNYKFRAIDTWKLVVDI^PRFSPYIAKSRSQIKNSW 

f 872 .nt 

ATGAGAAGTGCGGTTTTATTTTTTTTTGCTTTGCCTTTTTCTATTTCTTTGTATTCTTCAAGTAATAAAAATTTTC 
CGTATTGGATTTTACTTGAAAAAGGCAGGCAATTTCTTTATTCTAAATCTGAATTTAGTAAGTCTAATCTTACACA 
TGCTATTAATTATTTGCAGGAAGCTTTGCTTAGAAAAGGCGTTTATCCTGAGGCTAGTTATTATTTGTCAGTAGCT 
TATGGTATGTCTGGCAATGCTATTCTTGAAAAATTAAACCTTTATAAGTCTTTTGAAGACAGATATTATTTGCTAG 
ATGAATCTTTTGAAAAAAAAATACTTTTTTCTTTAGCTAAAATGGCTGAACTTGAGAATAATTATGTTGATACTAT 
TGATTATTTGAATGACATATTAAATAAGTTTTCAACTAAAAAAGATTATTATAGTTATCATGATTATTCTCAAGGC 
GAAAACAGTATGTCAAATAATGAACTTAATGCTTCATTTTATTTAACTTCTTATTTAAAACAAGTAAGAGGAGCTT 
TTGGTATTGATTTTACTTTTAATCTTTACAGATTTAAAAACTACAATGTTATTGATACTCATCAATTATTGTCAAA 
AGTTTATTTGCACTTAAAAGCTTATGAGCTTTCAATTACTCATGGACTTATAGCTGCAGTAGGAATTTTAACAAGA 
ATGTATGATTATGTTTGTTATTATGAACCTGTGTATCAGTTTAAAAATTTAAGGTCTTTTGTTCAAAAAATTAATA 
AGTATAAGGCAATAAAAAATGCTTTTGAATCTACAGATTTTTGGGAAATAGTTTATAATGTTGCTGCTGCTACTTA 
TGCATATTCTAATGGCAATTATAAATTTAGAGCAATAGATACTTGGAAATTAGTAGTAGATCTTGCGCCAAGGTTT 
TCTCCTTATATTGCTAAATCAAGAAGTCAAATTAAAAATTCTGTATATTTAAAAAAAAATTAA 

t872 .nt 

AGTAATAAAAATTTTCCGTATTGGATTTTACTTGAAAAAGGCAGGCAATTTCTTTATTCTAAATCTGAATTTAGTA 

AGTCTAATCTTACACATGCTATTAATTATTTGCAGGAAGCTTTGCTTAGAAAAGGCGTTTATCCTGAGGCTAGTTA 

TTATTTGTCAGTAGCTTATGGTATGTCTGGCAATGCTATTCTTGAAAAATTAAACCTTTATAAGTCTTTTGAAGAC 

AGATATTATTTGCTAGATGAATCTTTTGAAAAAAAAATACTTTTTTCTTTAGCTAAAATGGCTGAACTTGAGAATA 

ATTATGTTGATACTATTGATTATTTGAATGACATATTAAATAAGTTTTCAACTAAAAAAGATTATTAT^ 

TGATTATTCTCAAGGCGAAAACAGTATGTCAAATAATGAACTTAATGCTTCATTTTATTTAACTTCTTATTTAAAA 

CAAGTAAGAGGAGCTTTTGGTATTGATTTTACTTTTAATCTTTACAGATTTAAAAACTACAATGTTATTGATACTC 

ATCAATTATTGTCAAAAGTTTATTTGCACTTAAAAGCTTATGAGCTTTCAATTACTCATGGACTTATAGCTG 

AGGAATTTTAACAAGAATGTATGATTATGTTTGTTATTATGAACCTGTGTATCAGTTTAAAAATTTAAGGTCTTTT 

GTTCAAAAAATTAATAAGTATAAGGCAATAAAAAATGCTTTTGAATCTACAGATTTTTGGGAAATAGTTTATAATG 

TTGCTGCTGCTACTTATGCATATTCTAATGGCAATTATAAATTTAGAGCAATAGATACTTGGAAATTAGTAGTAGA 

TCTTX3CGCCAAGGTTTTCTCCTTATATTGCTAAATCAAGAAGTCAAATTAAAAATTCTGTATATTTAAAAAAAAAT 

TAA 

f874.aa 

MLKSNKWLIGAGGVGSSFAYALTIDNSLVHELVIIDWENKAKGEVMDIJSrHGQMFLK 

WITAGLNQKPGETRLDLVDKNSKIFKDIITNWSSGFDGIFWASNPVDIMTYVTM^ 

SRLRYFLiSDHFNVNTQNIHSYIMGEHXDSSFATWDETKIAMKPLSEYIiAEGKITELELDEIH^^ 
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GATYYAIGLGIKNIVNAIIGDQNVILPISSYINGQYGGLIKDIYIGAPAIVCKEGVKEVLNFKISPKELDKFNSSA 
NQLKSYIDKMEF 

t874 .aa 

ALTIDNSLVHELVIIDWENKAKGEVMDLNHGQMFLKKNINVLFGTYKDCANADIWITAGLNQKPGETRLDL^ 
NSKIFKDIITIWVSSGFDGIFWASNPVDIMTYWMKYSKFPIHKVIGTGTILDTSRLRYFLSDHFN^ 
IMGEHXDSSFATWDETKIAMKPLSEYIAEGKITELELDEIHKKVVNAAYEVIKLKGATYYAIG]^IKNI^^ 
QNVILPISSYINGQYGGLIKDIYIGAPAIVCKEGVKEVLNFKISPKELDKFNSSANQLKSYIDKMEF 

f874.nt 

ATGCTTAAGTCTAATAAAGTTGTTCTTATTGGAGCTGGTGGGGTTGGTTCAAGCTTTGCGTATGCTTTAACAATAG 
ACAATTCACTTGTACATGAACTTGTAATTATTGATGTTAATGAAAATAAAGCAAAAGGGGAGGTCATGGACCTTAA 
TCATGGCCAAATGTTTTTAAAGAAGAATATTAATGTATTGTTTGGGACTTACAAAGATTGTGCTAATGCAGATATT 
GTTGTAATTACAGCAGGACTTAATCAAAAGCCTGGTGAGACAAGACTTGATTTGGTTGATAAAAATTCTAAAATTT 
TTAAAGATATTATAACTAATGTTGTATCTAGCGGTTTTGATGGTATTTTTGTTGTTGCAAGCAATCCTGTAGACAT 
TATGACTTATGTTACAATGAAATATTCCAAATTTCCTATTCATAAGGTTATTGGTACTGGGACTATTCTTGATACT 
TCAAGACTTAGATATTTTTTAAGTGATCATTTTAATGTGAACACTCAAAATATACATTCATATATTATGGGTGAGC 
ACGTGACAGTTCTTTTGCTACGTGGGATGAAACAAAAATAGCAATGAAGCCTTTGTCAGAATATCTTGCTGAAGGC 
AAAATAACTGAGTTGGAGCTTGATGAAATTCATAAAAAGGTTGTGAATGCTGCTTATGAAGTTATTAAGTTAAAGG 
GGGCAACCTATTATGCTATTGGACTTGGTATTAAGAATATTGTAAATGCAATAATTGGAGATCAGAATGTTATTCT 
GCCAATATCTTCTTATATTAATGGCCAGTATGGGGGATTGATTAAAGATATTTATATTGGAGCGCCTGCTATAGTT 
TGTAAGGAAGGAGTCAAAGAAGTTTTAAACTTTAAGATAAGCCCTAAAGAGCTTGATAAGTTTAATAGTTCTGCTA 
ATCAGCTTAAAAGCTATATTGATAAAATGGAATTTTAG 

t874.nt 

GCTTTAACAATAGACAATTCACTTGTACATGAACTTGTAATTATTGATGTTAATGAAAATAAAGCAAAAGGGGAGG 
TCATGGACCTTAATCATGGCCAAATGTTTTTAAAGAAGAATATTAATGTATTGTTTGGGACTTACAAAGATTGTGC 
TAATGCAGATATTGTTGTAATTACAGCAGGACTTAATCAAAAGCCTGGTGAGACAAGACTTGATTTGGTTGATAAA 
AATTCTAAAATTTTTAAAGATATTATAACTAATGTTGTATCTAGCGGTTTTGATGGTATTTTTGTTGTTGCAAGCA 
ATCCTGTAGACATTATGACTTATGTTACAATGAAATATTCCAAATTTCCTATTCATAAGGTTATTGGTACTGGGAC 
TATTCTTGATACTTCAAGACTTAGATATTTTTTAAGTGATCATTTTAATGTGAACACTCAAAATATACATTCATAT 
ATTATGGGTGAGCACGTGACAGTTCTTTTGCTACGTGGGATGAAACAAAAATAGCAATGAAGCCTTTGTCAGAATA 
TCTTGCTGAAGGCAAAATAACTGAGTTGGAGCTTGATGAAATTCATAAAAAGGTTGTGAATGCTGCTTATGAAGTT 
ATTAAGTTAAAGGGGGCAACCTATTATGCTATTGGACTTGGTATTAAGAATATTGTAAATGCAATAATTGGAGATC 
AGAATGTTATTCTGCCAATATCTTCTTATATTAATGGCCAGTATGGGGGATTGATTAAAGATATTTATATTGGAGC 
GCCTGCTATAGTTTGTAAGGAAGGAGTCAAAGAAGTTTTAAACTTTAAGATAAGCCCTAAAGAGCTTGATAAGTTT 
AATAGTTCTGCTAATCAGCTTAAAAGCTATATTGATAAAATGGAATTTTAG 

f 886 .aa 

MKKKQLILLLFMPQIIYAKSYFASDVFFNKYQKLNEKPKTGFYIEYYSVDDTEKLYLYKENNLIKYKTIQIIENTK 

KITCYOTKOTKRKEEIYDNLNNKIQEIEYDSKGKTLETAimArENENLISKNLKTIN 

TGSNFQIV\m^X'GINGDIKSTYFDIKKATTKVIKYDDKKRNSNSTIIV^^SrKIKSKEra^QYLDEEKI 

STYKANNLIKEETYKNNELIKVNDFQYNESDMIIFQNTKEKDKDQYTNTKIEYEYNK1)NQLKSKKIYENDII 

EYHNDNEYEEEIYYNKKPALRVKHKNGKVTEEKPIGTN 

t886 .aa 

SYFASDVFFNKYQKLNEKPKTGFYIEYYSVDDTEKLYLYKEKnSTLIKYKTIQIIEOTKKITCYDTKOT 
IJ^NKIQEIEYDSKGKTLETANYWENENLISKNLKTINQKPKLIYYSKDDNGKLLKITGSNFQ^ 
YFDIKKATTKVIKYDDKKRNSNSTI IVNNKI KSKEKNQYLDEEKI VNTFEEENTKI I STYKA]^ 
IKVNDFQYNESDMIIFQNTKEKDKDQYTNTKIEYEYNKDNQLKSKKIYENDIIYLKTEYHNDNEYEEEIYYI^ 
LRVKHKNGKVTEEKPIGTN 
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f886.nt 

atgaaaaaaaaacaattaatacttcttctatttatgccacaaattatttatgcaaaaagctattttgcatctgatg 

tatttttcaataaataccaaaaattaaatgaaaaaccaaaaacggggttttatattgagtattattctgttg^^ 

tactgaaaaactctacctatacaaagaaaataacttaataaaatacaaaacaattcaaatcatagaaaacacaaaa 

aaaattacatgttatgatacaaaagatacaaaaagaaaagaagagatttacgataatttaaataacaaaatacaag 

aaattgaatatgatagcaaaggaaaaactcttgaaacagcaaattacgtttatgaaaacgaaaacttaatatctaa 

aaatttaaaaacaataaaccaaaaaccaaaattaatatattattctaaagacgacaatggtaaattactaaaaata 

acaggatcaaatttccaaatttggaactatggaattaatggcgacataaaatctacatattttgacatca;^^ 

caacaacaaaagttataaaatatgatgataaaaaaagaaattcaaacagtacaataattgttaataataaaataaa 

ATCCAAAGAAAAAAACCAATATTTAGATGAAGA/^AAAATAGTAAATACCTTTGAAGAAGAGAATACAAAAATCATA 

tctacctacaaggcaaacaacctaattaaagaagaaacatataaaaataatgaacttataaaagtaaatgattttc 
aatacaacgaatctgatatgataatttttcaaaacactaaagaaaaggataaagaccaatacaccaatactaaaat 
tgaatacgaatataacaaagacaatcaattaaaaagcaaaaaaatttatgagaacgatataatttatctaaaaact 
gaataccacaatgacaatgaatatgaagaagaaatatactacaataaaaaacctgctcttagggtaaaacacaaga 
acggaaaagtcaccgaagaaaaaccaataggaacaaattaa 

t886.nt 

AGCTATTTTGCATCTGATGTATTTTTCAATAAATACCAAAAATTAAATGAAAAACCAAAAACGGGGTTTTATATTG 
AGTATTATTCTGTTGATGATACTGAAAAACTCTACCTATACAAAGAAAATAACTTAATAAAATACAAAACAATTCA 
AATCATAGAAAACACAAAAAAAATTACATGTTATGATACAAAAGATACAAAAAGAAAAGAAGAGATTTACGATAAT 
TTAAATAACAAAATACAAGAAATTGAATATGATAGCAAAGGAAAAACTCTTGAAACAGCAAATTACGTTTATGAAA 
ACGAAAACTTAATATCTAAAAATTTAAAAACAATAAACCAAAAACCAAAATTAATATATTATTCTAAAGACGACAA 
TGGTAAATTACTAAAAATAACAGGATCAAATTTCCAAATTTGGAACTATGGAATTAATGGCGACATAAAATCTACA 
TATTTTGACATCAAAAAAGCAACAACAAAAGTTATAAAATATGATGATAAAAAAAGAAATTCAAACAGTACAATAA 
TTGTTAATAATAAAATAAAATCCAAAGAAAAAAACCAATATTTAGATGAAGAAAAAATAGTAAATACCTTTGAAGA 
AGAGAATACAAAAATCATATCTACCTACAAGGCAAACAACCTAATTAAAGAAGAAACATATAAAAATAATGAACTT 
ATAAAAGTAAATGATTTTCAATACAACGAATCTGATATGATAATTTTTCAAAACACTAAAGAAAAGGATAAAGACC 
AATACACCAATACTAAAATTGAATACGAATATAACAAAGACAATCAATTAAAAAGCAAAAAAATTTATGAGAACGA 
TATAATTTATCTAAAAACTGAATACCACAATGACAATGAATATGAAGAAGAAATATACTACAATAAAAAACCTGCT 
CTTAGGGTAAAACACAAGAACGGAAAAGTCACCGAAGAAAAACCAATAGGAACAAATTAA 

f 888.aa 

MEKLKLKLAIPLLVFTICKIHSQSNIEYNFSYIINTKKENIDLKKGIEKQLDKIYDKITEHIVNNDDKSIIEDIYI 

NQDIIKTELEISKLKKEMDKKKLQNIITAKEKHNTKTKIDELKKNIQNIl^KQKKFAEYFNNLKKLKVKYKKIEEQ 

TNISNLNKEFFIREELFFINYIDLKKIENYYLLEISNITPEKIETKKAVFKTSSSVNEIADHITKYSLKEILGREF 

LKININVKNNSDAKIYINEKFVSKGIYHDNIFDISKLPNKEIEIQITSANFENYSIKRT\nCNADSIILD 

SKKVSIKSNVQSKVFKKGIFMGETPIEIEKPENQDIILLKSKGYKDKFKLINKEEDQVEIEMIKTNKNRLIDTRDK 

FYVNIAWTLSTIGAIFAGTLLNNSEVLYKITGNHFINKRLTAEDVYMAKAEQMTATFLFGVGI 

HLVEYIKEANMGE 

t888.aa 

SNIEYNFSYIINTKKENIDLKKGIEKQLDKIYDKITEHIVNNDDKSIIEDIYINQDIIKTELEISKLKKEMDKKKL 

QNIITAKEKHNTKTKIDELKKNIQNirmKQKKFAEYFNNLKKLKWYKKIEEQTNISNLN^ 

LKKIENYYLLEISNITPEKIETKKAVFKTSSSVNEIADHITKYSLKEILGREFLKINim^KNNSDA^ 

KGIYHDNIFDISKLPNKEIEIQITSANFENYSIKRTVKNADSIILDIDLKRTISKKVSIKSNVQSKVFKKGIFMGE 

TPIEIEKPENQDIILLKSKGYKDKFKLINKEEDQVEIEMIKTNKNRLIiyrRDKFYWIAVFTLSTIGAIFAGTLLN 

NSEVLYKITGNHFI^fKRLTAEDVYMAKAEQMTATFLFGVGITLTIGSFISLITHLVEYIKEANMGE 
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ATGGA?^AAGCTTAAACTAAAGCTAGCAATACCATTGCTAGTATTTACAATATGCAAAATACATTCTCAAAGTA^ 

TTGAATACAATTTTTCCTATATCATTAATACAAAAAAAGAAAATATTGACCTAAAAAAGGGTATTGAAAAACAATT 

GGACAAAATCTATGATAAAATAACAGAACATATAGTAAACAATGATGACAAGAGCATCATTGAAGACATTTATATA 

AATCAAGATATAATAAAAACAGAACTTGAAATTAGCAAATTAAAAAAAGAAATGGATAAAAAAAAAC 

TAATAACCGCAAAAGAAAAGCATAACACCAAAACCAAAATTGATGAGCTTAAAAAAAATATTCAAAATATTAACAA 

TAAACAAAAAAAATTTGCAGAATATTTTAACAATTTAAAAAAACTAAAAGTAAAATATAAAAAAATCGAAGAGCAA 

ACAAATATATCAAATTTAAATAAAGAATTTTTTATAAGAGAAGAATTATTTTTTATTAACTATATTGATCTTAAAA 

AAATAGAAAATTATTATTTGCTAGAAATTAGCAACATCACTCCTGAGAAAATTGAGACTAAAAAAGCGGTATTTAA 

AACATCATCTTCTGTTAATGAAATTGCAGATCACATAACAAAATACAGCCTCAAAGAAATATTGGGCAGAGAATTT 

TTAAAAATCAACATTAACGTCAAAAATAACTCGGATGCAAAAATCTACATAAATGAAAAATTTGTTTCAAAAGGAA 

TCTATCACGATAATATTTTTGACATTTCTAAACTCCCAAACAAAGAAATTGAAATACAAATCACAAGTGCAAATTT 

CGAAAACTATTCTATTAAAAGAACGGTAAAAAATGCAGACTCAATAATATTAGATATTGACTTAAAAAGAACAATC 

TCTAAAAAAGTATCAATTAAAAGCAATGTACAATCTAAAGTTTTTAAAAAAGGAATATTTATGGGAGAAACCCCAA 

TTGAAATTGAAAAACCAGAAAATCAAGATATCATCTTGCTTAAATCTAAAGGATATAAAGATAAATTCAAGTTAAT 

AAATAAAGAAGAAGATCAAGTAGAAATAGAAATGATAAAAACTAACAAAAATAGACTTATCGACACAAGAGATAAA 

TTTTATGTCAATCTGGCCGTCTTTACATTAAGCACAATAGGAGCCATTTTTGCAGGAACATTGCTTAACAATTCAG 

AAGTACTTTATAAAATAACAGGCAATCACTTTATTAACAAAAGATTAACAGCAGAAGATGTTTATATGGCAAAAGC 

GGAACAAATGACTGCAACATTTCTATTTGGAGTAGGAATCACTTTAACTATTGGAAGCTTTATCTCATTAATAACT 

CATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG 

t888 .nt 

AGTAATATTGAATACAATTTTTCCTATATCATTAATACAAAAAAAGAAAATATTGACCTAAAAAAGGGTATTGAAA 

AACAATTGGACAAAATCTATGATAAAATAACAGAACATATAGTAAACAATGATGACAAGAGCATCATTGAAGACAT 

TTATATAAATCAAGATATAATAAAAACAGAACTTGAAATTAGCAAATTAAAAAAAGAAATGGATAAAAAAAAACTT 

CAAAACATAATAACCGCAAAAGAAAAGCATAACACCAAAACCAAAATTGATGAGCTTAAAAAAAATATTCAAAATA 

TTAACAATAAACAAAAAAAATTTGCAGAATATTTTAACAATTTAAAAAAACTAAAAGTAAAATATAAAAAA^ 

AGAGCAAACAAATATATCAAATTTAAATAAAGAATTTTTTATAAGAGAAGAATTATTTTTTATTAACTATATTGAT 

CTTAAAAAAATAGAAAATTATTATTTGCTAGAAATTAGCAACATCACTCCTGAGAAAATTGAGACTAAAAAAGCGG 

TATTTAAAACATCATCTTCTGTTAATGAAATTGCAGATCACATAACAAAATACAGCCTCAAAGAAATATTGGGCAG 

AGAATTTTTAAAAATCAACATTAACGTCAAAAATAACTCGGATGCAAAAATCTACATAAATGAAAAATTTGTTTCA 

AAAGGAATCTATCACGATAATATTTTTGACATTTCTAAACTCCCAAACAAAGAAATTGAAATACAAATCACAAGTG 

CAAATTTCGAAAACTATTCTATTAAAAGAACGGTAAAAAATGCAGACTCAATAATATTAGATATTGACTTAAAAAG 

AACAATCTCTAAAAAAGTATCAATTAAAAGCAATGTACAATCTAAAGTTTTTAAAAAAGGAATATTTATGGGAGAA 

ACCCCAATTGAAATTGAAAAACCAGAAAATCAAGATATCATCTTGCTTAAATCTAAAGGATATAAAGATAAATTCA 

AGTTAATAAATAAAGAAGAAGATCAAGTAGAAATAGAAATGATAAAAACTAACAAAAATAGACTTATCGACACAAG 

AGATAAATTTTATGTCAATCTGGCCGTCTTTACATTAAGCACAATAGGAGCCATTTTTGCAGGAACATTGCTTAAC 

AATTCAGAAGTACTTTATAAAATAACAGGCAATCACTTTATTAACAAAAGATTAACAGCAGAAGATGTTTATATGG 

CAAAAGCGGAACAAATGACTGCAACATTTCTATTTGGAGTAGGAATCACTTTAACTATTGGAAGCTTTATCTCATT 

AATAACTCATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG 

f893.aa 

MVRFLGFLYLITTIPLIKSCDAAQFGDYKPLYFENENDLKTANEYINSLGYKTISEYTTKIDILDFPENKEITINE 
INKLNNLDLRKSIFLKKLSNLFNIEHKKLLYVENRFKSINFKNLKKELNINADIHSLDYKTKINFISSIIFLIIII 
LLIFLDPTNSIFTLIFLLISSLAFMISKEIMYFYPFTVLSYLLFLIISNFNKNYNKIYLKEINFLTIJ4TKIKHLLF 
LFTFTALYFITITTFFTTNIDPTFIAFVAIPTLCIFLIFSWIKTESNFKDTFLFPIEIKEKKIEGKKALKSKIAIH 
LLLFTLSLIPFAYSSYMIJSlSYENINYLYSKKLKrYFDYLNPNNIYIMLGYNKD 

YGKIPKDIKENYFEIKNDKIEIHPKTVYEVDKSFIDEILKKDI^SLFLKNKNPILIYKENKl^INTDK 

FSLPFFVLLFLFKAIRFTILLNIN 

EKTYKKYIQG 

t893 ,aa 

CDAAQFGDYKPLYFENENDLKTANEYINSIiGYKTISEYTTKIDILDFPENKEITINEINKLNNLDLRKSIF 
NLFNIEHKKLLYVENRFKSINFKNLKKELNINADIHSLDYKTKINFISSIIFLIIIILLIFLDPTO 
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SSIAFMISKEIMYFYPFTVLSYLLFLIISNFNKlTmKIYLKEINFLTI^ 

IDPTFIAFVAIPTLCIFLIFSWIKTESNFKDTFLFPIEIKEKKIEGKKALKSKIAIHLLLFTLSLIPFAYSSYMLN 
SYENINYLYSKiCLNYFDYLNPmiYIMIXSYNKDMPNIIGYLSHILYQNELK 

lEIHPKTVYEVDKSFIDEILKKDIiASLFLKNKNPILIYKENKNNINTDKKNYKILFFFSLPFFVLLFLFK^ 
LLNINEKTYKKYIQG 

f893.nt 

ATGGTGCGTTTTTTAGGTTTTTTATATTTAATTACAACAATACCACTTATCAAATCCTGTGATGCAGCTCAATTTG 
GAGACTACAAACCTTTATACTTTGAAAATGAAAATGATCTAAAAACTGCCAATGAATATATAAATTCACTAGGATA 
CAAAACAATCTCAGAATACACAACAAAAATTGACATTTTAGACTTTCCCGAAAATAAAGAAATCACAATAAATGAG 
ATAAACAAACTT7VACAATCTTGACCTGAGAAAAAGCATATTTTTAAAAAAGCTCTCCAATCTTTTCAACATAGAGC 
ACAAAAAACTTCTTTATGTTGAAAACAGGTTTAAAAGTATAAATTTTAAAAACCTAAAAAAAGAACTCAATATTAA 
TGCCGACATACATTCTCTTGACTACAAAACAAAAATTAATTTTATTTCAAGCATAATATTTCTAATCATAATAATT 
TTATTAATTTTTTTAGACCCAACAAACTCTATATTTACTTTAATTTTTCTATTAATTTCATCTCTTGCTTTTATGA 
TAAGCAAAGAAATAATGTATTTTTATCCATTTACAGTTCTCTCTTATTTGTTATTTTTAATAATCAGTAATTTTAA 
CAAAAATTACAATAAAATATATTTAAAAGAAATAAATTTTTTAACACTAATGACAAAAATAAAACACTTACTATTT 
TTATTTACATTCACAGCTCTATATTTCATTACAATCACAACCTTTTTTACTACAAATATTGATCCCACTTTTATTG 
CATTTGTCGCAATACCAACCCTTTGCATTTTCTTAATTTTCAGCTGGATAAAAACAGAAAGCAATTTTAAAGACAC 
TTTCTTATTCCCAATCGAGATTAAAGAGAAAAAAATAGAAGGAAAAAAAGCTTTAAAATCAAAAATAGCAATACAT 
CTACTACTATTTACACTCTCATTAATTCCTTTCGCTTATTCAAGCTATATGCTAAATTCTTATGAAAACATTAACT 
ACCTTTACAGTAAAAAATTAAATTACTTTGATTATTTAAATCCTAATAACATTTATATAATGCTGGGATACAACAA 
AGACATGCCCAATATTATAGGGTACCTATCCCACATTCTTTATCAAAACGAACTAAAATACAATATTACCGCTAAG 
TATGGAAAAATTCCTAAAGATATAAAAGAAAATTACTTTGAAATCAAAAACGACAAAATAGAAATTCATCCTAAAA 
CTGTTTACGAAGTAGACAAATCATTTATTGATGT^AATTCTTAAAAAAGATCTTGCAAGTCTGTTTTTAAAAAATAA 
AAATCCAATCCTAATATATAAAGAAAACAAGAATAATATCAACACAGATAAAAAAAATTACAAAATACTTTTCTTT 
TTCTCTTTGCCCTTCTTTGTATTACTATTCCTATTTAAAGCAATAAGATTTACAATTCTTTTAAACATAAATGAAA 
AAACCTATAAAAAATATATTCAAGGATAA 

t893.nt 

tgtgatgcagctcaatttggagactacaaacctttatactttgaaaatgaaaatgatctaaaaactgccaatgaat 
atataaattcactaggatacaaaacaatctcagaatacacaacaaaaattgacattttagactttcccgaaaataa 
agaaatcacaataaatgagataaacaaacttaacaatcttgacctgagaaaaagcatatttttaaaaaagctctcc 
aatcttttcaacatagagcacaaaaaacttctttatgttgaaaacaggtttaaaagtataaattttaaaaacctaa 
aaaaagaactcaatattaatgccgacatacattctcttgactacaaaacaaaaattaattttatttcaagcataat 
atttctaatcataataattttattaatttttttagacccaacaaactctatatttactttaatttttctattaatt 
tcatctcttgcttttatgataagcaaagaaataatgtatttttatccatttacagttctctcttatttgttatttt 
taataatcagtaat^taacaaaaattacaataaaatatatttaaaagaaataaattttttaacactaatgaca;^ 
aataaaacacttactatttttatttacattcacagctctatatttcattacaatcacaaccttttttactacaaat 

ATTGATCCCACTTTTATTGCATTTGTCGCAATACCAACCCTTTGCATTTTCTTAATTTTCAGCTGGATAAA/^ 

aaagcaattttaaagacactttcttattcccaatcgagattaaagagaaaaaaatagaaggaaaaaaagctttaaa 

atcaaaaatagcaatacatctactactatttacactctcattaattcctttcgcttattcaagctatatgctaaat 

tcttatgaaaacattaactacctttacagtaaaaaattaaattactttgattatttaaatcctaataacatttata 

taatgctgggatacaacaaagacatgcccaatattatagggtacctatcccacattctttatcaaaacgaactaaa 

atacaatattaccgctaagtatggaaaaattcctaaagatataaaagaaaattactttgaaatcaaaaacgacaaa 

atagaaattcatcctaaaactgtttacgaagtagacaaatcatttattgatgaaattcttaaaaaagatct^ 

gtctgtttttaaaaaataaaaatccaatcctaatatataaagaaaacaagaataatatcaacacagataaaaaaaa 

ttacaaaatacttttctttttctctttgcccttctttgtattactattcctatttaaagcaataagatttacaat^ 

cttttaaacataaatgaaaaaacctataaaaaatatattcaaggataa 

f895.aa 

MIRALLTNDLFLSCLVSGISAQVIKYGIQTVKTRKLKLTPVHLLKKIFLETGGMPSSHSSTVTALSTSIALTEGID 

TNFIIAIAFALITIRDSFGVRYMSGVQAEYU^ALSEKLKiCEIKIDTTKIKWKGHKKKEV^ 

F 
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t895 .aa 

AQVIKYGIQTVKTRKLKLTPVHLLKKIFLETGGMPSSHSSTVTALSTSIALTEGIDTNFIIALAFALITIRDSFGV 
RYMSGVQAEYLNALSEKLKKEIKIDTTKI KWKGHKKKEVLTGI I IGIVSAYIVCYF 

£895. nt 

ATGATAAGGGCATTGCTTACCAATGATCTTTTTTTGTCTTGTCTTGTATCAGGAATTTCTGCTCAAGTGATTAAAT 
ATGGTATCCAAACTGTAAAAACAAGAAAGTTAAAACTAACTCCAGTACATCTTTTAAAAAAAATTTTTCTAGAAAC 
AGGAGGCATGCCAAGTAGTCATTCATCAACGGTCACCGCTCTTTCAACCTCAATCGCACTAACTGAAGGAATAGAT 
ACAAATTTTATAATAGCTCTTGCATTTGCCCTTATTACAATAAGAGATTCTTTCGGCGTAAGATATATGTCTGGAG 
TTCAAGCAGAATATTTAAATGCATTATCAGAAAAATTAAAAAAAGAAATAAAAATTGACACAACAAAAATAAAAGT 
GGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAATAGGAATAGTCTCTGCGTATATTGTGTGCTAT 
TTTTAG 

t895.nt 

GCTCAAGTGATTAAATATGGTATCCAAACTGTAAAAACAAGAAAGTTAAAACTAACTCCAGTACATCTTTTAAAAA 
AAATTTTTCTAGAAACAGGAGGCATGCCAAGTAGTCATTCATCAACGGTCACCGCTCTTTCAACCTCAATCGCACT 
AACTGAAGGAATAGATACAAATTTTATAATAGCTCTTGCATTTGCCCTTATTACAATAAGAGATTCTTTCGGCGTA 
AGATATATGTCTGGAGTTCAAGCAGAATATTTAAATGCATTATCAGAAAAATTAAAAAAAGAAATAAAAATTGACA 
CAACAAAAATAAAAGTGGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAATAGGAATAGTCTCTGC 
GTATATTGTGTGCTATTTTTAG 

f605.aa 

lyryiGAAGKSFSIIIDSAFLSNCFLFIGSFSRSDSIJyiSLSNSRFEYPYDASCEFSLVNIVKYVCGSKYSPMRPTLII 
SKLPVFLLL VRTGQFS LVS I RL I FRI FFHWF Z 

t605.aa 

CFLFIGSFSRSDSLMSLSNSRFEYPYDASCEFSLVNIVKYVCGSKYSPMRPTLIISKLPVFLLLVRTGQFSLVSIR 
LIFRIFFHWFZ 

f 605.nt 

ATGTATATTGGTGCAGCAGGAAAATCTTTTTCAATTATTATTGATTCTGCTTTTCTGAGTAATTGTTTTCTTTTTA 

TAGGATCTTTTTCAAGATCTGATTCTCTGATGAGTTTGTCAAATTCTAGGTTTGAATATCCGTATGATGCAAGTTG 

TGAATTTTCTCTTGTGAATATAGTAAAGTATGTGTGTGGATCTAAATATTCCCCAATGCGTCCAACTCTTATTATT 

TCAAAATTGCCAGTATTTCTGCTGTTGGTAAGAACAGGCCAATTTTCGTTGGTAAGCATAAGATTGATA^ 

TTTTTTTCCATTGGTTTTGA 

t605 .nt 

TGTTTTCTTTTTATAGGATCTTTTTCAAGATCTGATTCTCTGATGAGTTTGTCAAATTCTAGGTTTGAATATCCGT 
ATGATGCAAGTTGTGAATTTTCTCTTGTGAATATAGTAAAGTATGTGTGTGGATCTAAATATTCCCCAAT^ 
AACTCTTATTATTTCAAAATTGCCAGTATTTCTGCTGTTGGTAAGAACAGGCCAATTTTCGTTGGTAAGCATAAGA 
TTGATATTTAGAATTTTTTTCCATTGGTTTTGA 

f 606.aa 

MKLQRSLFLIIFFLTFLCCNNKERKEGVSFKISLGAEPSSLDPQIAEDm/ASKMIDTMFRGI\miDPNTGGN^ 
AKGWDISSDGTVYTFJSOiREKITWSIXSVAITAEGIRKSYLRIIJ^KETGSKYVEl^^ 

RAIDEKTLEITLESPKPYFIDMLVHQSFIPVPVHVTEKYGQNWTSPENMVTSGPFKLKERIPNEKYVFEKN^ 
SNEVE LEE ITFYTTNDS STAYKMYENE ELDAI FGS I PPDLI KNLKLRSDYYS S AVNAI YF YAFNTH I ^ 
KALTIxAIDRETLTYKVLDNGTTPTRRATPNFSSYSYAKSLELFNPEIAKTLLAEAGYPNGNGFPILKIiKYNTl^^ 
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KKICEFIQNQWKKNIJSriDVELENEEWTTYLNTKANGNYEIARAGWIGDYADPLTFLSIFTQGYTQFSSH^ 
NELIKKSDLELDPIKRQDILRQAEEIIIEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDLSQLKLKNKZ 

t606 .aa 

CCNNKERKEGVSFKISLGAEPSSLDPQLAEDNVASKMIDTMFRGIVTGDPNTGGNKPGLAKGWDISSTC 
REKITWSrXBVAITAEGIRKSYLRILNKETGSKYVEMVKSVIKNGQKYFIX^QVTDSELXSIR^ 
YFIDMLVHQSFI PVPVHVTEKYGQIWTSPENMVTSGPFKLKERI PNEKYWEKNNKYYDSN^ 

STAYKMYENEELDAIFGSIPPDLIKNLKLRSDYYSSAVNAIYFYAFNTHIKPLD^^VKIRKALTIAIDRETLTYKVL 
DNGTTPTRRATPNFSSYSYAKSLELFNPEIAKTLLAEAGYPNGNGFPILKLKYNTNEANKKICEFIQNQWKKNLNI 
DVELENEEWTTYIJSrTKANGNYEIARAGWIGDYADPLTFLSIFTQGYTQFSSHNYSNPEYNELIKKSDLELDPI^^ 
DILRQAEEIIIEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDLSQLKLKNKZ 

f606.nt 

ATGAAATTACAAAGGTCATTATTTTTAATAATATTTTTTCTAACTTTTCTTTGTTGTAATAACAAGGAAAGAAAAG 
AAGGAGTATCATTTAAAATAAGCTTGGGAGCAGAGCCAAGCAGTCTTGACCCTCAATTAGCAGAGGATAATGTCGC 
ATCAAAAATGATTGACACAATGTTTAGAGGGATTGTTACAGGAGATCCTAATACAGGGGGAAATAAACCGGGACTT 
GCAAAAGGGTGGGATATTTCTTCTGATGGAACAGTTTACACATTTAACCTAAGAGAAAAAATCACTTGGAGTGACG 
GAGTTGCAATCACTGCAGAAGGAATTAGAAAATCTTATCTTAGAATTTTAAATAAAGAAACTGGCTCTLAAGTACGT 
TGAAATGGTTAAATCGGTAATTAAAAATGGTCAAAAATATTTTGATGGACAAGTGACTGACTCTGAACTTGGAATT 
AGAGCGATTGATGAAAAAACATTAGAAATAACACTGGAATCACCAAAACCTTATTTTATTGATATGTTAGTACACC 
T^TCATTTATTCCAGTACCAGTTCATGTTACCGAAAAGTATGGACAAAACTGGACAAGCCCCGAAAACATGGTGAC 
AAGTGGTCCTTTTAAATTAAAAGAAAGAATTCCTAACGAAAAATATGTCTTTGAAAAAAATAACAAATACTACGAC 
TCAAATGAAGTAGAATTAGAAGAGATTACATTTTACACAACAAATGACAGCTCAACAGCGTATAAAATGTATGAAA 
ATGAAGAGCTAGATGCAATTTTTGGTTCCATACCCCCAGATCTAATCAAAAATCTAAAATTAAGAAGCGACTATTA 
CTCATCAGCTGTTAATGCCATATACTTTTACGCGTTCAATACACACATCAAACCACTTGACAACGTTAAAATTAGA 
AAAGCCTTAACTCTTGCTATTGACAGAGAAACGCTTACATATAAAGTTCTTGACAACGGGACTACCCCTACAAGAA 
GAGCAACTCCCAACTTTAGTTCATATTCTTATGCAAAAAGTTTAGAATTATTTAATCCTGAAATTGCAAAAACCCT 
TCTAGCTGAAGCTGGATATCCTAATGGCAATGGATTTCCAATTTTAAAATTAAAATACAATACAAACGAAGCAAAT 
AAAAAAATTTGTGAATTTATTCAAAACCAATGGAAAAAAAATTTAAATATTGATGTGGAACTTGAAAACGAAGAAT 
GGACAACATACTTAAACACTAAGGCAAATGGAAATTATGAAATAGCAAGAGCAGGATGGATAGGCGATTATGCTGA 
TCCTTTGACATTTTTAAGCATATTCACACAAGGATACACACAATTCTCATCTCATAATTACTCAAACCCAGAATAC 
AACGAACTTATAAAGAAATCCGACCTTGAGCTTGATCCAATAAAAAGACAAGACATTTTAAGACAAGCAGAAGAGA 
TAATTATTGAAAAAGATTTTCCAATAGCACCAATATACATATATGGGAACAGTTACCTTTTCAGAAATGACAAATG 
GACAGGGTGGAACACCAATATTTTAGAAAGATTTGATTTATCTCAGCTAAAATTAAAAAATAAATAA 

t606.nt 

TGTTGTAATAACAAGGAAAGAAAAGAAGGAGTATCATTTAAAATAAGCTTGGGAGCAGAGCCAAGCAGTCTTGACC 

CTCAATTAGCAGAGGATAATGTCGCATCAAAAATGATTGACACAATGTTTAGAGGGATTGTTACAGGAGATCCTAA 

TACAGGGGGAAATAAACCGGGACTTGCAAAAGGGTGGGATATTTCTTCTGATGGAACAGTTTACACATTTAACCTA 

AGAGAAAAAATCACTTGGAGTGACGGAGTTGCAATCACTGCAGAAGGAATTAGAAAATCTTATCTTAGAATTTTAA 

ATAAAGAAACTGGCTCAAAGTACGTTGAAATGGTTAAATCGGTAATTAAAAATGGTCAAAAATATTTTGATGGACA 

AGTGACTGACTCTGAACTTGGAATTAGAGCGATTGATGAAAAAACATTAGAAATAACACTGGAATCACCAAAACCT 

TATTTTATTGATATGTTAGTACACCAATCATTTATTCCAGTACCAGTTCATGTTACCGAAAAGTATGGACAAAACT 

GGACAAGCCCCGAAAACATGGTGACAAGTGGTCCTTTTAAATTAAAAGAAAGAATTCCTAACGAAAAATAT^ 

TGAAAAAAATAACAAATACTACGACTCAAATGAAGTAGAATTAGAAGAGATTACATTTTACACAACAAATGACAGC 

TCAACAGCGTATAAAATGTATGAAAATGAAGAGCTAGATGCAATTTTTGGTTCCATACCCCCAGATCTAATCAAAA 

ATCTAAAATTAAGAAGCGACTATTACTCATCAGCTGTTAATGCCATATACTTTTACGCGTTCAATACACACATCAA 

ACCACTTGACAACGTTAAAATTAGAAAAGCCTTAACTCTTGCTATTGACAGAGAAACGCTTACATATAAAGTTCTT 

GACAACGGGACTACCCCTACAAGAAGAGCAACTCCCAACTTTAGTTCATATTCTTATGCAAAAAGTTTAGAATT^^ 

TTAATCCTGAAATTGCAAAAACCCTTCTAGCTGAAGCTGGATATCCTAATGGCAATGGATTTCCAAT^ 

AAAATACAATACAAACGAAGCAAATAAAAAAATTTGTGAATTTATTCAAAACCAATGGAAAAAAAAT^ 

GATGTGGAACTTGAAAACGAAGAATGGACAACATACTTAAACACTAAGGCAAAT^ 

CAGGATGGATAGGCGATTATGCTGATCCTTTGACATTTTTAAGCATATTCACACAAGGATAC^ 
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TCATAATTACTCAAACCCAGAATACAACGAACTTATAAAGAAATCCGACCTTGAGCTTGATCCAATAAAAAGACAA 
GACATTTTAAGACAAGCAGAAGAGATAATTATTGAAAAAGATTTTCCAATAGCACCAATATACATATATGGGA^ 
GTTACCTTTTCAGAAATGACAAATGGACAGGGTGGAACACCAATATTTTAGAAAGATTTGATTTATCTCAGCTAAA 
ATTAAAAAATAAATAA 

f 679.aa 

MFNRSSCVLQNFLFLFLFLSLVSCFAKKEISGNNFIKAHSKEFDLNNLNWLWNFDYTKKNF 

YLFKKIGFEEKFVEYMKKAIANGDSIASQFAGIKLIEYFNSAKEYFASELIGEKLYKKYENNKFIILGYFKSLYWQ 
KKITOKALSLLNKLDKI^FSDYQENENILLKAVLYI^SNVSESKIYFNELFENLPANY 

GANFLNLVRFKYEVANGNFNGAINILNKNGLNDYYDIONI VLSDVYKAF I S SGKVSNALTFFSKI KSKYKIT^ 

NLREKNNLGLLLLKEYLEGLDLNNEINRLDLLNTAFSNLIFTKSARDYFAESLPKFYTEGDKKNSTFIKILEEYIL 

ESIQLEDYGNLYKLYSNAQKVISNSVLSKLAFINARLIYHKLIKPNVSGEYKSLLHSAVNYDKWSYSSFMSRYLLD 

QNIDEFFTGGSDIKYEQSDYEIFLEGFLKFNLCNYVRGFISEDFRNGYKFSLDFYRKVYDELLKSENYYDATLVIN 

YLWQDESALMENDYKRLYPYLYGSLIEYWAKRRGLEASVVFSLIKAESSFEKNAVSKPGAVGLMQVMPSTAND^ 

KELKYFKTf DLKI PKDNI I IGTYYLKKRI STTGSLYKAIASYNGGIGNVRKWEKSYGHLSKELFIEAI PFSQTRN^ 

KKILVYSVFYDALYEKKGIDSVIVKIMGEFPKNZ 

t679.aa 

CFAKKEISGNNFIKAHSKEFDLNNLNWLWNFDYTKKNFDKHFNIDPSSYIYVAYLFKKIGFEEKFVEY^ 
DSIASQFAGIKLIEYFNSAKEYFASELIGEKLYKKYENNKFIILGYFKSLYWQKKNDKALSLLNKLDKMKFSDYQE 
NENILLKAVLYLNLSNVSESKIYFNELFENLPANYLHVRAYDYFIIENKSRYFGANFLNLVRFKYEVANGNFNGAI 
NILNKNGLNDYYDNNIVLSDWKAFISSGKVSNALTFFSKIKSKYKNYYLGIDNLREK^ 

NEINRLDLLNTAFSNLIFTKSARDYFAESLPKFYTEGDKKNSTFIKILEEYILESIQLEDYGNLYKLYSNAQKVIS 

NSVLSKLAFINARLIYHKLIKPNVSGEYKSLLHSAVNYDKWSYSSFMSRYLLDQNIDEFFTGGSDIKYEQSDYEIF 

LEGFLKFNLCNYVRGFISEDFRNGYKFSLDFYRKVYDELLKSENYYDATLVIlSrirLVNQDESALMEN^ 

GSLIEYWAKRRGLEASWFSLIKAESSFEKNAVSKPGAVGLMQVMPSTANDISKELKYFlSnfDLKIPKDNIIIGT^ 

LKKRISTTGSLYKALASYNGGIGNVRKWEKSYGHLSKELFIEAIPFSQTRNYIKKILVYSVFYDALYEKKGIDSVI 

VKIMGEFPKNZ 

f 679.iit 

ATGTTTAATAGAAGTTCTTGTGTATTACAAAATTTTCTTTTTCTTTTTTTATTTTTAAGTTTAGTTTCTTGCTTTG 

CAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGCGCATTCAAAAGAGTTTGATTTAAATAATTTAAATTGGTT 

ATGGAATTTTGATTATACAAAAAAAAATTTTGATAAGCATTTTAACATAGATCCAAGTTCTTAC^ 

TATTTATTTAAAAAAATAGGATTTGAAGAGAAATTTGTAGAGTATATGAAAAAGGCCATAGCTAATGGAGATAGCA 

TTGCATCCCAGTTTGCTGGGATTAAGCTTATTGAATATTTTAACTCAGCAAAAGAGTATTTTGCATCTGAATTGAT 

TGGAGAGAAGCTTTATAAAAAATACGAAAATAATAAATTTATTATACTGGGGTACTTTAAAAGTCTTTATTGGCA^ 

AAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTGATAAGATGAAATTTTCTGATTATCAGGAAAATGAAA 

ATATTTTATTAAAAGCAGTTCTTTACCTTAATCTTTCTAATGTAAGTGAGTCAAAAATTTATTTTAATGAGCTTTT 

TGAGAACTTACCTGCAAATTATTTACATGTAAGAGCTTATGATTATTTTATTATTGAAAATAAGTCTAGGTA 

GGTGCAAATTTTTTAAATCTTGTTAGATTTAAGTATGAAGTGGCAAATGGCAATTTTAATGGTGCAATAAATATAT 

TAAATAAAAATGGTTTAAATGATTATTATGACAATAACATTGTATTAAGTGATGTTTATAAGGCTTTTATTAGTTC 

TGGCAAAGTTTCAAATGCTTTAACATTTTTTAGTAAAATAAAGAGCAAATATAAAAATTATTATTTAGGTATTCTA 

AACCTTAGAGAGAAAAATAATTTAGGACTTCTTCTTTTAAAAGAATATCTTGAAGGTTTAGATCTTAACAATGAGA 

TTAACAGGCTTGATTTGCTTAATACTGCTTTTAGCAATTTAATTTTTACTAAGAGCGCAAGGGATTATTTTGCCGA 

AAGTTTACCCAAGTTTTATACCGAGGGCGATAAAAAAAATTCTACTTTTATTAAGATTTTAGAAGAGTATATTTT 

GAATCAATTCAGCTTGAAGACTATGGCAATCTTTATAAGCTTTATTCTAATGCTCAAAAAGTTATTTCTAATTCTG 

TTTTGTCTAAGCTTGCTTTTATTAATGCAAGGCTTATATATCATAAATTAATTAAACCTAACGTAAGCGGAGAATA 

CAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGGTCTTATTCTTCATTTATGAGTAGGTACTTATT^ 

CAAAATATTGATGAATTTTTTACAGGTGGGTCTGATATTAAGTATGAGCAATCCGATTATGAGATTTTTTTGGAA 

GGTTTTTAAAATTCAATCTTTGTAATTATGTTAGAGGGTTTATTTCTGAGGATTT^ 

ACTTGATTTTTATCGAAAAGTATACGATGAACTTTTAAAGAGTGAAAATTATTACGATGCAACTCTTGTC 

TATCTTGTAAATCAAGATGAATCTGCTTTAATGGAGAATGACTATAAAAGACTTTATC 

TGATAGAATATTGGGCTAAAAGGAGAGGGCTTGAAGCTAGTGTTGTATTTTCTTTAATAAAAGCAGAGAG 
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TGAAAAAAATGCTGTCTCAAAACCGGGTGCTGTTGGCCTTATGCAGGTTATGCCATCAACAGCAAATGATATTTCT 
AAAGAACTTAAGTATTTTAACTATGATTTAAAGATTCCAAAAGATAATATAATAATTGGAACATATTATTTAAAAA 
AAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCTTGCGTCTTATAATGGGGGTATTGGTAATGTTAGAAAGTG 
GGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTTTTATTGAGGCAATTCCCTTTAGTCAAACTAGGAATTATATT 
AAAAAAATATTAGTTTATTCGGTATTTTATGATGCTTTGTATGAAAAGAAGGGAATAGATTCAGTAATAGTTAAAA 
TTATGGGCGAATTCCCCAAAAATTAA 

t679 .nt 



TGCTTTGCAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGCGCATTCAAAAGAGTTTGATTTAAATAATTTAA 
ATTGGTTATGGAATTTTGATTATACAAAAAAAAATTTTGATAAGCATTTTAACATAGATCCAAGTTCTTACATATA 
TGTTGCTTATTTATTTAAAAAAATAGGATTTGAAGAGAAATTTGTAGAGTATATGAAAAAGGCCATAGCTAATGGA 
GATAGCATTGCATCCCAGTTTGCTGGGATTAAGCTTATTGAATATTTTAACTCAGCAAAAGAGTATTTTGCATCTG 
AATTGATTGGAGAGAAGCTTTATAAAAAATACGAAAATAATAAATTTATTATACTGGGGTACTTTAAAAGTCTTTA 
TTGGCAAAAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTGATAAGATGAAATTTTCTGATTATCAGGAA 
AATGAAAATATTTTATTAAAAGCAGTTCTTTACCTTAATCTTTCTAATGTAAGTGAGTCAAAAATTTATTTTAATG 
AGCTTTTTGAGAACTTACCTGCAAATTATTTACATGTAAGAGCTTATGATTATTTTATTATTGAAAATAAGTCTAG 
GTATTTTGGTGCAAATTTTTTAAATCTTGTTAGATTTAAGTATGAAGTGGCAAATGGCAATTTTAATGGTGCAATA 
AATATATTAAATAAAAATGGTTTAAATGATTATTATGACAATAACATTGTATTAAGTGATGTTTATAAGGCTTTTA 
TTAGTTCTGGCAAAGTTTCAAATGCTTTAACATTTTTTAGTAAAATAAAGAGCAAATATAAAAATTATTATTTAGG 
TATTCTAAACCTTAGAGAGAAAAATAATTTAGGACTTCTTCTTTTAAAAGAATATCTTGAAGGTTTAGATCTTAAC 
AATGAGATTAACAGGCTTGATTTGCTTAATACTGCTTTTAGCAATTTAATTTTTACTAAGAGCGCAAGGGATTATT 
TTGCCGAAAGTTTACCCAAGTTTTATACCGAGGGCGATAAAAAAAATTCTACTTTTATTAAGATTTTAGAAGAGTA 
TATTTTGGAATCAATTCAGCTTGAAGACTATGGCAATCTTTATAAGCTTTATTCTAATGCTCAAAAAGTTATTTCT 
AATTCTGTTTTGTCTAAGCTTGCTTTTATTAATGCAAGGCTTATATATCATAAATTAATTAAACCTAACGTAAGCG 
GAGAATACAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGGTCTTATTCTTCATTTATGAGTAGGTACTT 
ATTAGATCAAAATATTGATGAATTTTTTACAGGTGGGTCTGATATTAAGTATGAGCAATCCGATTATGAGATTTTT 
TTGGAAGGGTTTTTAAAATTCAATCTTTGTAATTATGTTAGAGGGTTTATTTCTGAGGATTTTAGGAATGGATATA 
AATTTTCACTTGATTTTTATCGAAAAGTATACGATGAACTTTTAAAGAGTGAAAATTATTACGATGCAACTCTTGT 
GATTAATTATCTTGTAAATCAAGATGAATCTGCTTTAATGGAGAATGACTATAAAAGACTTTATCCTTATTTGTAT 
GGATCTTTGATAGAATATTGGGCTAAAAGGAGAGGGCTTGAAGCTAGTGTTGTATTTTCTTTAATAAAAGCAGAGA 
GTAGCTTTGAAAAAAATGCTGTCTCAAAACCGGGTGCTGTTGGCCTTATGCAGGTTATGCCATCAACAGCAAATGA 
TATTTCTAAAGAACTTAAGTATTTTAACTATGATTTAAAGATTCCAAAAGATAATATAATAATTGGAACATATTAT 
TTAAAAAAAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCTTGCGTCTTATAATGGGGGTATTGGTAATGTTA 
GAAAGTGGGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTTTTATTGAGGCAATTCCCTTTAGTCAAACTAGGAA 
TTATATTAAAAAAATATTAGTTTATTCGGTATTTTATGATGCTTTGTATGAAAAGAAGGGAATAGATTCAGTAATA 
GTTAAAATTATGGGCGAATTCCCCAAAAATTAA 



fll-12.nt 

TAAAAGGAGA ATATTTTTAT 
TTGATGTCTT GTAATCTAGA 
AATGTAAAAG AAGTTTCGGA 
CAAGAAAAGC AAAAAAGCTT 
AATCCTATAG AGCCTATAAT 
ATTTCAATTG CGCATACTGA 
AATGAAGAAA AGGAAGCTGA 
TCTAAATTTT CTGAATTAAT 
AAAGCTGATT TGTATGATGT 
AATCCTAAGA ACAATAGAGA 
AAGATAGATA GTGAACTTGA 
AGCTCTGCGG CTTTCTTTTT 
AGATTAGAGA GTAAAAATAA 
GACGCAAGAA GTGCTTTAAG 
GTGAGAAAGG AAGAAATAAA 
AATAAAAAAT AA 



GAGAAAAAGT TTGTTTTTAT 
TTCCAAATTA TCTAGTAACA 
TAGTGTTCAA GAAGATGGTC 
TACTAAAAAT TTTGGAGAAC 
ACCTTCAGAA TCACCAAAGA 
AAAAAAAGAG ACAAAAAAGG 
TGCAGCAATT AAATATTTAG 
TAGAGAAGTA CGTGTAATTA 
AATTGGAAAG ATTAACAATA 
TAAGATAAAT AAATTAACAC 
GCAGCTTATA AATATGATTG 
TGACAACGCT CAGAAAAGGT 
TAGATCTTAT GCATTAAAAT 
TAATTTAGAA TCTTTTGCCT 
AGAGCTTATT AAACATGCAA 



ATGCATTATT AATGGGAGGA 
AAGAACAAAA AAATAACAAT 
TTAATGATTT ATATAATAAT 
GGAAATATGA GGATTTAATT 
ATAAGGCTAA TATACCAAAT 
AGAATTTAAT CCCTTCTACT 
AAGAAAATAT TCTTAAAAAC 
AAGATGAATA TGCTTTAATA 
AAAAAACATC ATTAATGGAG 
AATTGTTGCA AAATAATTTA 
ATATGGCAGA AAATGAAATA 
TAAAAGAAAG CATTATTAAA 
TGTCTAGACA GGCTTTAAGT 
CTAAAAGAAT TGAACCAATG 
AAACTGTTTT AGAAAGTCTC 
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tll-12 .nt 



TTGTAATCTAGATTCCAAATTATCTAGTAACAAAGAACAAAAAAATAACAATAATGTAAAAGAAGTTTCGGATAGT 

GTTCAAGAAGATGGTCTTAATGATTTATATAATAATCAAGAAAAGCAAAAAAGCTTTACTAAAAATTTT^ 

GGAAATATGAGGATTTAATTAATCCTATAGAGCCTATAATACCTTCAGAATCACCAAAGAATAAGGCTAATATACC 

AAATATTTCAATTGCGCATACTGAAAAAAAAGAGACAAAAAAGGAGAATTTAATCCCTTCTACTAATGAAGAAAAG 

GAAGCTGATGCAGCAATTAAATATTTAGAAGAAAATATTCTTAAAAACTCTAAATTTTCTGAATTAATTA^ 

TACGTGTAATTAAAGATGAATATGCTTTAATAAAAGCTGATTTGTATGATGTAATTGGAAAGATTAACAATAAA 

AACATCATTAATGGAGAATCCTAAGAACAATAGAGATAAGATAAATAAATTAACACAATTGTTGCAAAATAATTTA 

AAGATAGATAGTGAACTTGAGCAGCTTATAAATATGATTGATATGGCAGAAAATGAAATAAGCTCTGCGGCTTTCT 

TTTTTGACAACGCTCAGAAAAGGTTAAAAGAAAGCATTATTAAAAGATTAGAGAGTAAAAATAATAGATCTTATGC 

ATTAAAATTGTCTAGACAGGCTTTAAGTGACGCAAGAAGTGCTTTAAGTAATTTAGAATCTTTTGCCTCTAAAAGA 

ATTGAACCAATGGTGAGAAAGGAAGAAATAAAAGAGCTTATTAAACATGCAAAAACTGTTTTAGAAAGTCTCAATA 

AAAAA 



fll-12 .aa 



KENIFMRKSL FLYALLMGGL 
EKQKSFTKNF GERKYEDLIN 
EEKEADAAIK YLEENILKNS 
PKNNRDKINK LTQLLQNNLK 
LESKNNRSYA LKLSRQALSD 
KK 



MSCNLDSKLS SNKEQKNNNN 
PIEPIIPSES PKNKANIPNI 
KFSELIREVR VIKDEYALIK 
IDSELEQLIN MIDMAENEIS 
ARSALSNLES FASKRIEPMV 



VKEVSDSVQE DGLNDLYNNQ 

SIAHTEKKET KKENLIPSTN 

ADLYDVIGKI NNKKTSLMEN 

SAAFFFDNAQ KRLKESIIKR 

RKEEIKELIK HAKTVLESLN 



tll-12 .aa 

CNLDSKLSSNKEQKlSnraNVKEVSDSVQEDGLNDLYNNQEKQKSFTKNFGERKYEDLINPIEPIIPSESP^ 
NISIAHTEKKETKKENLIPSTNEEKEADAAIKYLEENILKNSKFSELIREVRVIKDEYALIKADLYDVIGKINNKK 
TSLMENPKNNRDKINKLTQLLQNNLKIDSELEQLINMIDMAENEISSAAFFFDNAQKRLKESIIKRLESKI^ 
LKLSRQALSDARSALSNLESFASKRIEPMVRKEEIKELIKHAKTVLESLNKK 



fll-4.nt 



TAAAGGAGTT TACAAATGAG 
TGTAAATGGT ATGTAGACAA 
ACATCTATTG ATCAAGTATT 
ATCACAAAAT TAACTCCGGA 
GAAAAATCCA AAAAAGAAAT 
GTAAAGGATA CTCCTCGCTT 
GTTTTTCAAA CACTAATTAA 
AAGAATGGAC TAAAGATGGT 
AATGGTGATA AAAGTACCCA 
GCTGAAAATT CGGTAAGCGT 
AAATGTATTC AAACTCTTAT 
CTTAAAAACA AAAATGATGG 



TAAACTAATA TTGGCAATAT 
TACCATTGAT GAAGCAACTG 
AGATGAGATA AGTGAAGCCA 
AGAGCTAGAA AATTTAGCAA 
TGAAGATCAA AAAAATACCA 
AATCAAATTG ATAAAGAATT 
TATAGGTTAT AATGCTACCT 
GAAATTACTG GATGAGTTGC 
AAAATACAAT GAACTTAAAA 
TTCTTTTAAA GAACATTCAA 
GAAAAATGTA GAAACATACT 
TGAGTACGAA AAAACATTGA 



CTATACTGCT AATAATTTCA 
TAGAAAGTAA ATCAGCACTA 
CAGGCCTAAG TTCGGAAAAA 
AGGAAGCTCA AGATGACTCT 
AGGAAAGTAA AAACATAGAA 
CATCAGAAAA AATTGATTCG 
ATGCAGCCAA AAGTAATTTG 
TAAAAATATC GGTAAGTAGC 
CCGTTGTAAA TAAGTTTAAT 
ACAGTAAAAT TGAAACTAAA 
TTGAAGGTGT ATGCAGCGAA 
CAACTTTAAG CTAA 



tll-4.nt 



ATGTAAATGGTATGTAGACAATACCATTGATGAAGCAACTGTAGAAAGTAAATCAGCACTAACATCTATTGATCAA 

GTATTAGATGAGATAAGTGAAGCCACAGGCCTAAGTTCGGAAAAAATCACAAAATTAACTCCGGAAGAGCTAGAAA 

ATTTAGCAAAGGAAGCTCAAGATGACTCTGAAAAATCCAAAAAAGAAATTGAAGATCAAAAAAATACC 

TAAAAACATAGAAGTAAAGGATACTCCTCGCTTAATCAAATTGATAAAGAATTCATCAGAAAAAATTGAT^ 

TTTCAAACACTAATTAATATAGGTTATAATGCTACCTATGCAGCCAAAAGTAATTTGAAGAATGGACTA 

TGAAATTACTGGATGAGTTGCTAAAAATATCGGTAAGTAGCAATGGTGATAAAAGTACCCAi^^ 

TAAAACCGTTGTAAATAAGTTTAATGCTGAAAATTCGGTAAGCGTTTCTTTTAAAGAACATTCAAACAGTAAAATT 
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GAAACTAAAAAATGTATTCAAACTCTTATGAAAAATGTAGAAACATACTTTGAAGGTGTATGCAGCGAACTTAAAA 
ACAAAAATGATGGTGAGTACGAAAAA 

fll-4.aa 

RSLQMSKLIL AISILLIISC KWYVDNTIDE ATVESKSALT SIDQVLDEIS EATGLSSEKI 
TKLTPEELEN LAKEAQDDSE KSKKEIEDQK NTKESKNIEV KDTPRLIKLI KNSSEKIDSV 
FQTLINIGYN ATYAAKSNLK NGLKMVKLLD ELLKISVSSN GDKSTQKYNE LKTWNKFNA 
ENSVSVSFKE HSNSKIETKK CIQTLMKWE TYFEGVCSEL KNKNDGEYEK TLTTLS 

tll-4.aa 

CKWYVDNTIDEATVESKSALTSIDQVLDEISEATGLSSEKITKLTPEELENLAKEAQDDSEKSKKEIEDQKNTKES 
KNIEVKDTPRLIKLIKNSSEKIDSVFQTLINIGYNATYAAKSNLKNGLKMVKLLDELLKISVSSNGDKSTQKYNEL 
KTWNKFNAENSVSVSFKEHSNSKIETKKCIQTLMKNVETYFEGVCSELKNKNDGEYEK 

f 112-1. nt 

TGAATCTCTA AAGATTTTAG CAGGGGAGAA AATATGAAAA AAAGTTTTTT ATCAATATAC 
ATGTTAATTT CAATAAGTTT ATTATCATGT GATGTTAGTA GATTAAATCA GAGAAATATT 
AATGAGCTTA AAATTTTTGT TGAAAAGGCC AAGTATTATT CTATAAAATT AGACGCTATT 
TATAACGAAT GTACAGGAGC ATATAATGAT ATTATGACTT ATTCGGAAGG TACATTTTCT 
GATCAAAGTA AGGTTAATCA AGCTATATCT ATATTTAAAA AAGACAATAA AATTGTTAAT 
AAGTTTAAGG AGCTTGAAAA GATTATAGAA GAATACAAAC CTATGTTTTT AAGTAAATTA 
ATTGATGATT TTGCGGGATC CGTT 

tll2-l .nt 

ATGTGATGTTAGTAGATTAAATCAGAGAAATATTAATGAGCTTAAAATTTTTGTTGAAAAGGCCAAGTATTATTCT 
ATAAAATTAGACGCTATTTATAACGAATGTACAGGAGCATATAATGATATTATGACTTATTCGGAAGGTACATTTT 
CTGATCAAAGTAAGGTTAATCAAGCTATATCTATATTTAAAAAAGACAATAAAATTGTTAATAAGTTTAAGGAGCT 
TGAAAAGATTATAGAAGAATACAAACCTATGTTTTTAAGTAAATTAATTGATGATTTT 

fll2-l.aa 

ISKDFSRGEN MKKSFLSIYM LISISLLSCD VSRLNQRNIN ELKIFVEKAK YYSIKLDAIY 
NECTGAYNDI MTYSEGTFSD QSKVNQAISI FKKDNKIVNK FKELEKIIEE YKPMFLSKLI 
DDFAGSV 

tll2-l.aa 

CDVSRI^QRNINELKIFVEKAKYYSIKLDAIYNECTGAYNDIMTYSEGTFSDQSKVNQAISIFKKDNKIVNKFKE^ 
EKIIEEYKPMFLSKLIDDF 

fl4-8.nt 

TAAATACAGA. GCCATTCAAG GAGAGTATTT ATGAAATACT ATATATGTGT GTGTGTTTTT 
TTGCTTTTGA ATGCTTGCAA TTCAGATTTT AGCACTAATC AAGAAGATAT TAAATATCCA 
TCTGATAAAG AGAAATCAAA ATCCAACATG GAAGCAAGCT CTAAAGAAGA AGATCCAAAT 
AAAAAAATAA AAAATACACT GCTTAATGAT TTAATAAATT TGATAGAAAT AGCTAATGAG 
CATAAAGAAA AATATGAAAA AAGAATGCAA GAAGAACCTT CAGATCAATA CGGAATATTG 
GCTTTCCAGG AATTAGACTT GTCCGTTGGA AAAATATCTG AAGACACCCC GCAATCTAAA 
AAATTTAGAA AAAACACCTA TTCTCCCTTA AGCGCTATTG ATGTCAATAA ATTAAAAGAT 
CTTTCAGAGA TTATAAGAAA TTCGGGCCAA ATACAAGGTT TATTTAATAT TTTCAACAGA 
TTCGGAGGCA TTTTTGACGA CTCACTTAAT CACGTATATT CTAAAAAAGA TATCCTAGGG 
GGACTAGAAA TTTTGGATTT AGATAAACTA AAAAATTCGT TTGAAAAATT ACTATCTATA 
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AAAGAAACTT TCTCAAAAAT GCTAAATCAA CTTTTATTAG ATTATAAAAA TGATAAAGAT 
CATATACGAA CAGAGACAAA TAAACTTAAA TCTCATACAA CTGCACTTTT CGAACAACTT 
GATAAAAAAG AAGACGAAGC ATATGAACCT AAAAATCAGA TATTTTCAAT AAGTAACCTT 
TAA 

tl4-8.nt 

TTGCAATTCAGATTTTAGCACTAATCAAGAAGATATTAAATATCCATCTGATAAAGAGAAATCAAAATCCAAC^ 

GAAGCAAGCTCTAAAGAAGAAGATCCAAATAAAAAAATAAAAAATACACTGCTTAATGATTTAATAAATTTGAT^ 

AAATAGCTAATGAGCATAAAGAAAAATATGAAAAAAGAATGCAAGAAGAACCTTCAGATCAATACGGAATATTGGC 

TTTCCAGGAATTAGACTTGTCCGTTGGAAAAATATCTGAAGACACCCCGCAATCTAAAAAATTTAGAAAAAACACC 

TATTCTCCCTTAAGCGCTATTGATGTCAATAAATTAAAAGATCTTTCAGAGATTATAAGAAATTCGGGCCAAATAC 

AAGGTTTATTTAATATTTTCAACAGATTCGGAGGCATTTTTGACGACTCACTTAATCACGTATATTCTAAAAAAGA 

TATCCTAGGGGGACTAGAAATTTTGGATTTAGATAAACTAAAAAATTCGTTTGAAAAATTACTATCTATAAAAGAA 

ACTTTCTCAAAAATGCTAAATCAACTTTTATTAGATTATAAAAATGATAAAGATCATATACGAACAGAGACAAATA 

AACTTAAATCTCATACAACTGCACTTTTCGAACAACTTGATAAAAAAGAAGACGAAGCATATGAACCTAAAAATCA 

G 

fl4-8.aa 

IQSHSRRVFM KYYICVCVFL LLNACNSDFS TNQEDIKYPS DKEKSKSNME ASSKEEDPNK 
KIKNTLLNDL INLIEIANEH KEKYEKRMQE EPSDQYGILA FQELDLSVGK ISEDTPQSKK 
FRKNTYSPLS AIDVNKLKDL SEIIRNSGQI QGLFNIFNRF ' GGIFDDSLNH VYSKKDILGG 
LEILDLDKLK NSFEKLLSIK ETFSKMLNQL LLDYKNDKDH IRTETNKLKS HTTALFEQLD 
KKEDEAYEPK NQIFSISNL 



tl4-8 .aa 

CNSDFSTNQEDIKYPSDKEKSKSNMEASSKEEDPNKKIKNTLLNDLINLIEIANEHKEKYEKRMQEEPSDQYGILA 
FQELDLSVGKISEDTPQSKKFRKNTYSPLSAIDVNKLKDLSEIIRNSGQIQGLFNIFNRFGGIFDDSLNHVYSKKD 
ILGGLEILDLDKLKNSFEKLLSIKETFSKMLNQLLLDYKNDKDHIRTETNKLKSHTTALFEQLDKKEDEAYEPKNQ 

fl7-6.nt 

TAAAGGAGGG TATTTATGAA 
GCTTGCAGGC CGGATTTTAA 
TCAAGGCCCA AAACTGAAAG 
CTTAAGAAAA AACAACAAGA 
AAAAAACAAC AAGAAGAAGA 
AGAAAACAAC AACTAAAAAA 
TACAATTTTA AAGAAAAATA 
ATGACGTCTT TTAGGGGATT 
GAAAGATCTA TAAGATATAG 
GAATTAAAGG AATTCGCAAA 
ATATTTAATT CTTTTAGCGC 
TTCAAAAAAG ACAATCTAGA 
TTTGAACAAA TATTATATAT 
GATTATAAAA ATCTAAAAAC 
AATGGAATTA AGCAACAAGC 
TATAAACTTT AA 

tl7-6.nt 

TTCCAGGCCGGATTTTAATATCGATCAAAAAGACATTAAATACCCGCCTACTGAAAAATCAAGGCCCAAAACTGAA 
AGCTCTAAGCAAAAAGAATCAAAGCCTAAAACAGAAGAAGAGCTTAAGAAAAAACAACAAGAAGAAGAGCTTAAGA 



ATACCACATA ATTACAACTA 
TATCGATCAA AAAGACATTA 
CTCTAAGCAA AAAGAATCAA 
AGAAGAGCTT AAGAAAAAAC 
GCTTAAGAAA AAACAACAAG 
TACGCTATCT AATGATTTAA 
TGTAAAAAGT ATGGAAAAAG 
GAATTGGGGG CCAGGGACTG 
AAGACACACT TATACTGTTT 
TATTATTCAA GATATAAATA 
TATTGGAGGA GCTCTTGACA 
CAAACTAGAT ATTGCAGATT 
AAAAGGAAGT GTTGCAGGAA 
AGATATTAAT AAGCTTAAAT 
TCTAGAAGCA GAAAATCTAG 



TATTTGTTTT TCTGTTTTTA 
AATACCCGCC TACTGAAAAA 
AGCCTAAAAC AGAAGAAGAG 
AAGAAGAAGA AGAGCTTAAG 
AAGAAGAGAA GGAAGAACTA 
AAAAGCAAAT AGAATCGGCC 
AACCTGAAGA CCATTACGGG 
AAGATATATC TGACAATACC 
TAAGCCCCCT GGATCCTCAT 
AACTAGCATC AGTAGCAAGT 
TAGTAAGTGA TCACCTATAT 
TAGAAATACT TAAAAATTCA 
AAGCAAAAAA ACTTTTATTA 
CTTATTCAAA TGAACTGGTT 
AAGAGCTTAT AGTGTCAAAA 
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AAAAACAACAAGAAGAAGAGCTTAAGAAAAAACAACAAGAAGAAGAGCTTAAGAAAAAACAACAAGAAGAAGAGAA 
GGAAGAACTAAGAAAACAACAACTAAAAAATACGCTATCTAATGATTTAAAAAAGCAAATAGAATCGGCCTACAAT 
TTTAAAGAAAAATATGTAAAAAGTATGGAAAAAGAACCTGAAGACCATTACGGGATGACGTCTTTTAGGGGATTGA 
ATTGGGGGCCAGGGACTGAAGATATATCTGACAATACCGAAAGATCTATAAGATATAGAAGACACACTTATACTGT 
TTTAAGCCCCCTGGATCCTCATGAATTAAAGGAATTCGCAAATATTATTCAAGATATAAATAAACTAGCATCAGTA 
GCAAGTATATTTAATTCTTTTAGCGCTATTGGAGGAGCTCTTGACATAGTAAGTGATCACCTATATTTCAAAAAAG 
ACAATCTAGACAAACTAGATATTGCAGATTTAGAAATACTTAAAAATTCATTTGAACAAATATTATATATAAAAGG 
AAGTGTTGCAGGAAAAGCAAAAAAACTTTTATTAGATTATAAAAATCTAAAAACAGATATTAATAAGCTTAAATCT 
TATTCAAATGAACTGGTTAATGGAATTAAGCAACAAGCTCTAGAAGCAGAAAATCTAGAAGAGCTTATAGTGTCAA 
AATATAAACTT 



fl7-6.aa 



RRVFMKYHII TTIFVFLFLA 
KKKQQEEELK KKQQEEELKK 
NFKEKYVKSM EKEPEDHYGM 
LKEFANIIQD INKLASVASI 
EQILYIKGSV AGKAKKLLLD 
KL 



CRPDFNIDQK DIKYPPTEKS 

KQQEEELKKK QQEEEKEELR 

TSFRGLNWGP GTEDISDNTE 

FNSFSAIGGA LDIVSDHLYF 

YKNLKTDINK LKSYSNELVN 



RPKTESSKQK ESKPKTEEEL 

KQQLKNTLSN DLKKQIESAY 

RSIRYRRHTY TVLSPLDPHE 

KKDNLDKLDI ADLEILKNSF 

GIKQQALEAE NLEELIVSKY 



tl7-6 .aa 



CRPDFNIDQKDIKYPPTEKSRPKTESSKQKESKPKTEEELKKKQQEEELKKKQQEEELKKKQQEEELKKKQQEEEK 
EELRKQQLKNTLSNDLKKQIESAYNFKEKYVKSMEKEPEDHYGMTSFRGLNWGPGTEDISDNTERSIRYRRHTYTV 
LSPLDPHELKEFANIIQDINKLASVASIFNSFSAIGGALDIVSDHLYFKKDNLDKLDIADLEILKNSFEQILYIKG 
SVAGKAKKLLLDYKNLKTDINKIiKSYSNELVNGIKQQALEAENLEELIVSKYKL 



fl9-2.nt 

TAAAGAAAGA 
ATATTTGTTT 
AAACCTAAAA 
AAAGAAGAAA 
AATGATTTAA 
TTAAAAGAAG 
GATGGAACAA 
GCTTATAGCA 
ATGGCATCAG 
GAAAAGATAG 
GTTTTAAAAA 
GAAATGATGC 
AAAAATGAAC 
GAAGCACTAA 



TTAAATCATA 
TTCTATTTTT 
CAGAAACAAG 
AAGAAGCAAA 
GAAATTTAAT 
AATCCTCAAG 
ATGAACAATT 
TCTTAAATAC 
GACAAACACA 
TTAATTGTTT 
AGCTTAAAGA 
ACAAGCTCTT 
TTAAGTCTTA 
AGCTAAAAAA 



TTCAAGGAGA 
AAATGCTTGT 
CTTAAATCAA 
AGAAGAAGGC 
AGAAACAGCT 
CCAATACGGA 
GTCCGCAAAT 
TATTAATGAC 
GGGCATATTT 
GTATCCCAAA 
TTCTTTGGAA 
ATTAGACTAT 
TGCAGACACA 
TACCATATGC 



GTATTTATGA 
TATCCAGTTG 
GAAGAAGTCC 
ATTAATAAAA 
AAAAAAGATA 
ATACTGGCTT 
ACCGAAAGAT 
GCTTCCTTAA 
AATACCCTTA 
AAAGACAATT 
AATTTTTTAG 
CAAAATAATA 
CTTTTCAATC 
TCAATAGAGG 



AACACTATAT 
CATCTAATAA 
CAAATCAAGA 
AAACAGAAAA 
ATGATAAATA 
TCAAAGATTT 
CTAAAGCCTA 
AGAATTTTTC 
ACTCACTTGG 
TGGAAAAATT 
AGATAAAAAA 
CAAATCGTAT 
AAATGACAAA 
ACCTTTAA 



AATTGTGCAT 
AATAGAATTA 
AGCAAACTAC 
CACGCTGCTT 
TACACAAAAG 
GTTCTGGCTA 
TAGAAAACGA 
AGAAATTGTA 
GGGTAATTTT 
AGAGACTTCA 
T^TCGCCTCA 
ACAAACAGAT 
AAAACCCGAA 



tl9-2.nt 



TTGTTATCCAGTTGCATCTAATAAAATAGAATTAAAACCTAAAACAGAAACAAGCTTAAATCAAGAAGAAGTCCCA 

AATCAAGAAGCAAACTACAAAGAAGAAAAAGAAGCAAAAGAAGAAGGCATTAATAAAAAAACAGAAAACACGCTGC 

TTAATGATTTAAGAAATTTAATAGAAACAGCTAAAAAAGATAATGATAAATATACACAAAAGTTAAAAGAAGAATC 

CTCAAGCCAATACGGAATACTGGCTTTCAAAGATTTGTTCTGGCTAGATGGAACAAATGAACAATTGTCCGCAAAT 

ACCGAAAGATCTAAAGCCTATAGAAAACGAGCTTATAGCATCTTAAATACTATTAATGACGCTTCCTTAAAGAATT 

TTTCAGAAATTGTAATGGCATCAGGACAAACACAGGGCATATTTAATACCCTTAACTCACTTGGGGGTAAT 

AAAGATAGTTAATTGTTTGTATCCCAAAAAAGACAATTTGGAAAAATTAGAGACTTCAGTTTTAAAAAAGCTO 

GATTCTTTGGAAAATTTTTTAGAGATAAAAAAAATCGCCTCAGAAATGATGCACAAGCTCTTATTAGACTATC 

ATAATACAAATCGTATACAAACAGATAAAAATGAACTTAAGTCTTATGCAGACACACTTTTCAATCAAATC 

AAAACCCGAAGAAGCACTAAAG 
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fl9-2 .aa 

RKIKSYSRRV FMKHYIIVHI FVFLFLNACY PVASNKIELK PKTETSLNQE EVPNQEANYK 

EEKEAKEEGI NKKTENTLLN DLRNLIETAK KDNDKYTQKL KEESSSQYGI LAFKDLFWLD 

GTNEQLSANT ERSKAYRKRA YSILNTINDA SLKNFSEIVM ASGQTQGIFN TLNSLGGNFE 

KIVNCLYPKK DNLEKLETSV LKKLKDSLEN FLEIKKIASE MMHKLLLDYQ NNTNRIQTDK 
NELKSYADTL FNQMTKKPEE ALKLKNTICS lEDL 

tl9-2 .aa 

CYPVASNKIELKPKTETSLNQEEVPNQEANYKEEKEAKEEGINKKTENTLLNDLRNLIETAKKDNDKYTQKLKEES 
SSQYGILAFKDLFWLDGTNEQLSANTERSKAYRKRAYSILNTINDASLKNFSEIVMASGQTQGIFNTLNSLGGNFE 
KIVNCLYPKKDNLEKLETSVLKKLKDSLENFLEIKKIASEMMHKLLLDYQNNTNRIQTDKNELKSYADTLFNQMTK 
KPEEALK 

f 19-4 .nt 

TAATCTATAC TAATTGAGGA GAATATTTTT ATGAAAAACA ACATAATTTT ATGCATGTGT 

GTTTTTTTAC TTTTAAATAG CTGCACCGCT AACCATGAAG CTGAAGCGAA AATAAAAAAA 

CATGTTGATA AAACAAAAAA CGAATATATT AATGAAATAA AAAATTTAAT AGCAACAACC 

AAAGAAATCA TCGAAAAACG AAAATTGCTA CAAGCTAAAC CAGTAGATCA AAACCCCGTA 

GATGATACAA ACAATAAGAA AGTTTTCGAG ATAGATAAAA GAGCTTTCGA TTTTATAAAT 

AGTTTTTTAA CAGATGATGA ATTTAATAAA TTTGTAACAA TATTTCATAA ACCAACACTA 

AAATCACCCG GAAAAGTATT AAATAGCATA GCAATTCTAG AGCTAAACAT AGAGCAGGTA 

ATTAATCACC TAGACTCAAA AAATGAGACC TTAAATAAAG CAAGCTCTTT AGATTTGGAA 

AAGATCAAAA ATTCCCTTGA ACAGCTGTTC TCTATAAGGA ATTTTTTTTC AACAATCATA 

AAAAGGGTCT TATTAGATCA TCAAAACAAT GAAAATTCTA TAAAACCAGA TGATTCTAT^ 

TCAGGAACCT ATTTCGATAC GATATACGAT CAGTTTAATG AAAAAAATAA AGAGGTTAGA 

AATCTGAAAA AAACCATATT ATCACTGCCG AATTAA 

tl9-4,nt 

CTGCACCGCTAACCATGAAGCTGAAGCGAAAATAAAAAAACATGTTGATAAAACAAAAAACGAATATATTAATGAA 
ATAAAAAATTTAATAGCAACAACCAAAGAAATCATCGAAAAACGAAAATTGCTACAAGCTAAACCAGTAGATCAAA 
ACCCCGTAGATGATACAAACAATAAGAAAGTTTTCGAGATAGATAAAAGAGCTTTCGATTTTATAAATAGTTTTTT 
AACAGATGATGAATTTAATAAATTTGTAACAATATTTCATAAACCAACACTAAAATCACCCGGAAAAGTATTAAAT 
AGCATAGCAATTCTAGAGCTAAACATAGAGCAGGTAATTAATCACCTAGACTCAAAAAATGAGACCTTAAATAAAG 
CAAGCTCTTTAGATTTGGAAAAGATCAAAAATTCCCTTGAACAGCTGTTCTCTATAAGGAATTTTTTTTCAACAAT 
CATAAAAAGGGTCTTATTAGATCATCAAAACAATGAAAATTCTATAAAACCAGATGATTCTAAATCAGGAACCTAT 
TTCGATACGATATACGATCAGTTTAATGAAAAAAATAAAGAGGTTAGAAATCTGAAAAAA 

fl9-4.aa 

SILIEENIFM KNNIILCMCV FLLLNSCTAN HEAEAKIKKH VDKTKNEYIN EIICNLIATTK 
EIIEKRKLLQ AKPVDQNPVD DTNNKKVFEI DKRAFDFINS FLTDDEFNKF VTIFHKPTLK 
SPGKVLNSIA ILELNIEQVI NHLDSKNETL NKASSLDLEK IKNSLEQLFS IRNFFSTIIK 
RVLLDHQNNE NSIKPDDSKS GTYFDTIYDQ FNEKNKEVRN LKKTILSLPN 

tl9-4 .aa 

CTANHEAEAKIKKHVDKTKNEYINEIKNLIATTKJ:iIEKRKLLQAKPVDQNPVDrXrN^ 
TDDEFNKFVTIFHKPTLKSPGKVLNSIAILEIJSriEQVINHLDSKNETLNKASSLDLEKIKNSLEQLFS^ 
I KRVLLDHQNNEN S I K PDDSKSGTYFDT I YDQFNEKNKEVRNLKK 



fl9-6.nt 
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TAAAGGAGAG TATTAATGAA 
GCTTGCAGTA CAGATTTTAA 
TCAAAGCCCA AAACTGAAGA 
CTAAAGAAAA AACAACAACT 
ACAGCTAATA AGCATAAAGA 
TACGGGGTAC AGGCTTTCAA 
AACACCGAAA GATCTATAAG 
CTTCATGAAT TAAAGGAATT 
GTAGATATGT TTAATTTCTT 
TTATATCCCA AAAAGACAAT 



ATGCCATATA ATTGCAACTA 
TACTGATCAA AAAGGCATTA 
CTCTAAGCAA AAAGAATTAA 
AAAAAATAAA CTACTTAATG 
AAAGTATAAA AAAAGAATGA 
AGGATCGAAT TGGGGGCCGG 
ATTTAGAAGA CATACTTATA 
CTCAAATATT GTTACAAATG 
TAGCTCTATT GGGACAGCTC 
CTGGACAAAC CAGATCTGTC 



TATTTGTTTT TCTATTTTTA 
AATACCCGCC TACCGAAAAA 
AGCCTAAAAC AGAAAAAGAA 
ATTTAAAAAA TTCAATAGAA 
AAGAAGAACC CGAAGATCAA 
GGACTGAAGA TGTATCTGCC 
CTATTTTAAG CACGCTGAGT 
AAAATAAACT GGTGCCAGTA 
TTGATATAAC AACCGATAGC 
GGATTTAG 



tl9-6 .nt 



TTGCAGTACAGATTTTAATACTGATCAAAAAGGCATTAAATACCCGCCTACCGAAAT^TCAAAGCCCAAAACTGAA 
GACTCTAAGCAAAAAGAATTAAAGCCTAAAACAGAAAAAGAACTAAAGAAAAAACAACAACTAAAAAATAAACTAC 
TTAATGATTTAAAAAATTCAATAGAAACAGCTAATAAGCATAAAGAAAAGTATAAAAAAAGAATGAAAGAAGAACC 
CGAAGATCAATACGGGGTACAGGCTTTCAAAGGATCGAATTGGGGGCCGGGGACTGAAGATGTATCTGCCAACACC 
GAAAGATCTATAAGATTTAGAAGACATACTTATACTATTTTAAGCACGCTGAGTCTTCATGAATTAAAGGAATTCT 
CAAATATTGTTACAAATGAAAATAAACTGGTGCCAGTAGTAGATATGTTTAATTTCTTTAGCTCTATTGGGACAGC 
TCTTGATATAACAACCGATAGCTTATATCCCAAAAAGACAATCTGGACAAACCAGATCTGTCGG 



fl9-6.aa 



RRVLMKCHII ATIFVFLFLA 
KKKQQLKNKL LNDLKNSIET 
TERSIRFRRH TYTILSTLSL 
YPKKTIWTNQ ICRI 



CSTDFNTDQK GIKYPPTEKS 
ANKHKEKYKK RMKEEPEDQY 
HELKEFSNIV TNENKLVPW 



KPKTEDSKQK ELKPKTEKEL 
GVQAFKGSNW GPGTEDVSAN 
DMFNFFSSIG TALDITTDSL 



tl9-6 .aa 



CSTDFNTDQKGIKYPPTEKSKPKTEDSKQKELKPKTEKELKKKQQLKNKLLNDLKNSIETANKHKEKYKKRMKEEP 
EDQYGVQAFKGSNWGPGTEDVSANTERSIRFRRHTYTILSTLSLHELKEFSNIVTNENKLVPWDMFNFFSSIGTA 
LDITTDSLYPKKTIWTNQICR 



f21-4.nt 



TAGGAGACAA TCTTTATGAA 
CTGATAAGTT CTTGTAAGAA 
TTAGAAAGTT CAGAACAAAA 
GGATTTTTAG AAATTTTAGA 
AAACAAATTC AAGAATTAAA 
GAAACATATT CTGGGTATGA 
GGACTTGAAG ATAAATTAAA 
AAAAAAGCTT TACAAGAGGC 
GCAACTGGAG TAACGCATGG 
TGGCAGTGTG CTAATAGTTT 
ATGACCAATG AAGTTATAAC 
GGAGAAACTG TAGAAGGTAA 

t21-4.nt 



TAAAAAAATA AAAATGTTTA 
TGATGTAACT AGTAAAGATT 
TGTAAAAAAA ACAGAACAAG 
GACAAAAGAT TTAAACACAT 
GAATAAGATA GAAAAATTAG 
AGAAAAAATA AACAAAATAA 
TGAACTTTCA GAGAGCTTAA 
TAAAAAGAAA TTTGAAGAGT 
TTCTCAAGTC CAAAGACAAG 
GGGGTTTAAA AATATGACTA 
TAATTCGCTT AAAAAGATTG 
AAAAGAATAA 



TTATTTGTGC TATTTTTATG 
TAGAAGGGGC GGTGAAAGAT 
AGATAAAAAA ACAAGTTGAA 
TAGATACAAA AGAAATTGAA 
ACTCTAAAAA AACTTCTATT 
AAGAAAAATT AAACGGAAAA 
AAAAGAAAAA AGAGGAGAGA 
ATAAAAACCA AGCTGAATCT 
GTGGTGTTGG ATTACAAGCT 
GTGGTAATAA TACTAGCGAT 
AAGAAGAACT TAAAAATATT 



TTGTTUVGAATGATGTAACTAGTAAAGATTTAGAAGGGGCGGTGAAAGATTTAGAAAGTTCAGAACAAAATGTAAAA 
A7UUVCAGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTAGAAATTTTAGAGACAAAAGATTTAAACACATTAG 
ATACAAAAGAAATTGAAAAACAAATTCAAGAATTAAAGAATAAGATAGAAAAATTAGACTCTAAAAAAACTTCTAT 
TGAAACATATTCTGGGTATGAAGAAAAAATAAACA7VAATAAAAGAAAAATTAAACGGAAAAGGACTTG 
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TTAAATGAACTTTCAGAGAGCTTAAAAAAGAAAAAAGAGGAGAGAAAAAAAGCTTTACAAGAGGCTAAAAAGAAAT 
TTGAAGAGTATAAAAACCAAGCTGAATCTGCAACTGGAGTAACGCATGGTTCTCAAGTCCAAAGACAAGGTGGTGT 
TGGATTACAAGCTTGGCAGTGTGCTAATAGTTTGGGGTTTAAAAATATGACTAGTGGTAATAATACTAGCGATATG 
ACCAATGAAGTTATAACTAATTCGCTTAAAAAGATTGAAGAAGAACTTAAAAATATTGGAGAAACTGTAGAAGGTA 
AAAAAGAA 

£21-4. aa 

ETIFMNKKIK MFIICAIFML ISSCKNDVTS KDLEGAVKDL ESSEQNVKKT EQEIKKQVEG 
FLEILETKDL NTLDTKEIEK QIQELKNKIE KLDSKKTSIE TYSGYEEKIN KIKEKLNGKG 
LEDKLNELSE SLKKKKEERK KALQEAKKKF EEYKNQAESA TGVTHGSQVQ RQGGVGLQAW 
QCANSLGFKN MTSGNNTSDM TNEVITNSLK KIEEELKNIG ETVEGKKE 

t21-4.aa 

CKNDVTSKDLEGAVKDLESSEQNVKKTEQEIKKQVEGFLEILETKDLNTLDTKEIEKQIQELKNKIEKLDSKKTSI 
ETYSGYEEKINKIKEKLNGKGLEDKLNELSESLKKKKEERKKALQEAKKKFEEYKNQAESATGVTHGSQVQRQGGV 
GLQAWQCAMSLGFKNMTSGNNTSDMTNEVITNSLKKIEEELKNIGETVEGKKE 

£24-1. nt 

TAAGCTGGTA ACACTGTAAA GACAGCTGAG GGGGCTTCAA GTGGTACTGA TGCAATTGGA 
GAAGTTGTGG ATAATGATGC TAAGGTTGCT GATAAGGCGA GTGTGACGGG GATTGCTAAG 
GGGATAAAGG AGATTGTTGA AGCTGCTAGG GGGAGTGAAA AGCTGAAAGT TGCTGCTGCT 
AAAGAGGGCA ATGAAAAGGC AGGGAAGTTG TTTGGGAAGG CTGGTGCTAA TGCTCATGGG 
GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGG GGAGCAGATA 
TTAAGTGCGA TTGTTAAGGC TGCGGATGCG GCTGAGCAGG ATGGAAAGAA GCCTGCAGAT 
GCTACAAATC CGATTGCTGC TGCTATTGGG AATAAAGATG AGGATGCGGA TTTTGGTGAT 
GGGATGAAGA AGGATGATCA GATTGCTGCT GCTATTGCTT TGAGGGGGAT GGCTAAGGAT 
GGAAAGTTTG CTGTGAAGAA TGATGAGAAA GGGAAGGCTG AGGGGGCTAT TAAGGGAGCT 
GCTGCAATTG GAGAAGTTGT GGATAATGCT GGTGCTGCGA AGGCTGCTGA TAAGGATAGT 
GTGAAGGGGA TTGCTAAGGG GATAAAGGAG ATTGTTGAAG CTGCTGGGGG GAGTGAAAAG 
CTGAAAGCTG CTGCTGCTGA AGGGGAGAAT AATAAAAAGG CAGGGAAGTT GTTTGGGAAA 
GTTGATGGTG CTGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGAGCAGGAT 
GGAGAGAAGC CTGAGGATGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTAATGGG 
GATGGTGCGG AGTTTGATCA GGATGAGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 
GCTTTGAGGG GGATGGCTAA GGATGGAAAG TTTGCTGTGA AGGGTAATAA TGAGAAAGAG 
AAGGCTGAGG GGGCTATTAA AGAAGTTAGC GAGTTGTTGG ATAAGCTGGT AACAGCTGTA 
AAGACAGCTG AGGGGGCTTC AAGTGGTACT GATGCAATTG GAGAAGTTGT GGATAATGNT 
GCNAAGGNTG CTGATAAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTN GGGGGAGTGA AAAGCTGAAA GTTGCTGCTG CTANAGNGGN NAATAATAAA 
GAGGCAGGGA AGTTGTTTGG GAAGGCTGGT GCTGATGCTA ATGGGGACAG TGAGGCTGCT 
AGCAAGGCGG CTGGTGCTGT TAGTGCTGTT AGTGGGGAGC AGATATTAAG TGCGATTGTT 
AAGGCTGCGG CTGCTGGTGC GGCTGATCAG GATGGAGAGA AGCCTGGGGA TGCTAAAAAT 
CCGATTGCTG CTGCTATTGG GAAGGGTAAT GCGGATGATG GTGCGGATTT TGGTGATGGG 
ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG TGAAGAAGGA TGAGAAAGGG AAGGCTGAGG GGGCTATTAA GGGAGCTAGC 
GAGTTGTTGG ATAAGCTGGT AAAAGCTGTA AAGACAGCTG AGGGGGCTTC AAGTGGTACT 
GCTGCAATTG GAGAAGTTGT GGATAATGCT GCGAAGGCTG CTGATAAGGA TAGTGTGACG 
GGGATTGCTA AGGGGATAAA GGAGATTGTT GAAGCTGCAG GGGGGAGTGA AAAGCTGAAA 
GTTGCTGCTG CTAAAGGGGA GAATAATT^AA GGGGCAGGGA AGTTGTTTGG GAAGGCTGGT 
GCTAATGCTC ATGGGGACAG TGAGGCTGCT AGCAAGGCGG CTGGTGCTGT TAGTGCTGTT 
AGTGGGGAAC AGATATTAAG TGCGATTGTT AAGGCTGCTG GTGAGGCTGC TGGTGATCAG 
GAGGGAAAGA AGCCTGAGGA GGCTAAAAAT CCGATTGCTG CTGCTATTGG GGATAAAGAT 
GGGGATGCGG AGTTTAATCA GGATGGGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 
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GCTTTGAGGG GGATGGCTAA GGATGGAAAG TTTGCTGTGA AGGATGGTGG TGAGAAAGAG 
AAGGCTGAGG GGGCTATTAA AGGAGTTAGC GAGTTGTTGG ATAAGCTGGT AAAAGCTGTA 
AAGACAGCTG AGGGGGCTTC AAGTGGTACT GCTGCAATTG GAGAAGTTGT GGCTGATGCT 
GCTAAGGTTG CTGATAAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTG GGGACAGTGA GGCTGCTAGC AAGGCAGCTG GTGCTGTTAG TGCTGTTAGT 
GGGGAGCAGA TATTAAGTGC GATTGTTAAG GCTGCGGCTG CTGGTGCGGC TGAGCAGGAT 
GGAGAGAAGC CTGCAGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTGATGGG 
GATGCGGATT TTGGTGAGGA TGGGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGCTAAGGA TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGCTGCAATT GGAGAAGTTG TGGATAATGC TGGTGCTGCG 
AAGGCTGCTG ATAAGGATAG TGTGAAGGGG ATTGCTAAGG GGATAAAGGA GATTGTTGAA 
GCTGCTGGGG GGAGTGAAAA GCTGAAAGCT GCTGCTGCTG AAGGGGAGAA TAATAAAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGGT GCTGCTGGGG ACAGTGAGGC TGCTAGCAAG 
GCGGCTGGTG CTGTTAGTGC TGTTAGTGGG GAGCAGATAT TAAGTGCGAT TGTTAAGGCT 
GCGGATGCGG CTGAGCAGGA TGGAAAGAAG CCTGCAGATG CTACAAATCC GATTGCTGCT 
GCTATTGGGA ATAAAGATGA GGATGCGGAT TTTGGTGATG GGATGAAGAA GGATGATCAG 
ATTGCTGCTG CTATTGCTTT GAGGGGGATG GCTAAGGATG GAAAGTTTGC TGTGAAGGGT 
AATAATGAGA AAGGGAAGGC TGAGGGGGCT TCAAGTGGTA CTGATGCAAT TGGAGAAGTT 
GTGGATAATG ATGCGAAGGC TGCTGATAAG GCGAGTGTGA CGGGGATTGC TAAGGGGATA 
AAGGAGATTG TTGAAGCTGC TGGGGGGAGT GAAAAGCTGA AAGCTGTTGC TGCTGCTACA 
AGGGAGAATA ATAAAGAGGC AGGGAAGTTG TTTGGGAAAG TTGATGATGC TCATGCTGGG 
GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGC GGAGCAGATA 
TTAAGTGCGA TTGTTACGGC TGCGGCTGCT GGTGAGCAGG ATGGAGAGAA GCCTGCAGAG 
GCTACAAATC CGATTGCTGC TGCTATTGGG AAGGGTAATG AGGATGGTGC GGATTTTGGT 
AAGGATGAGA TGAAGAAGGA TGATCAGATT GCTGCTGCTA TTGCTTTGAG GGGGATGGCT 
AAGGATGGAA AGTTTGCTGT GAAGAGTAAT GATGGTGAGA AAGGGAAGGC TGAGGGGGCT 
ATTAAGGAAG TTAGCGAGTT GTTGGATAAG CTGGTAAAAG CTGTAAAGAC AGCTGAGGGG 
GCTTCAAGCG GTACTGATGC AATTGGAGAA GTTGTGGCTA ATGCTGGTGC TGCGAAGGCT 
GCTGATAAGG CGAGTGTGAC GGGGATTGCT AAGGGGATAA AGGAGATTGT TGAAGCTGCT 
GGGGGGAGTA AAAAGCTGAA AGCTGCTGCT GCTGAAGGGG AGAATAATAA AAAGGCAGGG 
AAGTTGTTTG GGAAGGCTGG TGCTGGTGCT GGTGCTAATG GGGACAGTGA GGCTGCTAGC 
AAGGCGGCTG GTGCTGTTAG TGCTGGTTAG 

t24-l .nt 

TGGTGAGGCTGAGCAGGATGGAGAGAAGCCTGAGGATGCTAAAAATCCGATTGCTGCTGCTATTGGGAAGGGTAAT 

GGGGATGGTGCGGAGTTTGATCAGGATGAGATGAAGAAGGATGATCAGATTGCTGCTGCTATTGCTTTGAGGGGGA 

TGGCTAAGGATGGAAAGTTTGCTGTGAAGGGTAATAATGAGAAAGAGAAGGCTGAGGGGGCTATTAAAGAAGTTAG 

CGAGTTGTTGGATAAGCTGGTAACAGCTGTAAAGACAGCTGAGGGGGCTTCAAGTGGTACTGATGCAATTGGAGAA 

GTTGTGGATAATGNTGCNAAGGNTGCTGATAAGGCGAGTGTGACGGGGATTGCTAAGGGGATAAAGGAGATTGTTG 

AAGCTGCTNGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGCTANAGNGGNNAATAATAAAGAGGCAGGGAAGTTC 

TGGGAAGGCTGGTGCTGATGCTAATGGGGACAGTGAGGCTGCTAGCAAG 

£24-1. aa 

AGNTVKTAEG ASSGTDAIGE WDNDAKVAD KASVTGIAKG IKEIVEAARG SEKLKVAAAK 
EGNEKAGKLF GKAGANAHGD SEAASKAAGA VSAVSGEQIL SAIVKAADAA EQDGKKPADA 
TNPIAAAIGN KDEDADFGDG MKKDDQIAAA lALRGMAKDG KFAVKNDEKG KAEGAIKGAA 
AIGEWDNAG AAKAADKDSV KGIAKGIKEI VEAAGGSEKL KAAAAEGENN KKAGKLFGKV 
DGAAGDSEAA SKAAGAVSAV SGEQILSAIV KAAGEAEQDG EKPEDAKNPI AAAIGKGNGD 
GAEFDQDEMK KDDQIAAAIA LRGMAKDGKF AVKGNNEKEK AEGAIKEVSE LLDKLVTAVK 
TAEGASSGTD AIGEWDNXA KXADKASVTG lAKGIKEIVE AAXGSEKLKV AAAXXXNNKE 
AGKLFGKAGA DANGDSEAAS KAAGAVSAVS GEQILSAIVK AAAAGAADQD GEKPGDAKNP 
lAAAIGKGNA DDGADFGDGM KKDDQIAAAI ALRGMAKDGK FAVKKDEKGK AEGAIKGASE 
LliDKLVKAVK TAEGASSGTA AIGEWDNAA KAADKDSVTG lAKGIKEIVE AAGGSEKLKV 
AAAKGENNKG AGKLFGKAGA NAHGDSEAAS KAAGAVSAVS GEQILSAIVK AAGEAAGDQE 
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GKKPEEAKNP lAAAIGDKDG DAEFNQDGMK KDDQIAAAIA 
AEGAIKGVSE LLDKLVKAVK TAEGASSGTA AIGEWADAA 
AAGDSEAASK AAGAVSAVSG EQILSAIVKA AAAGAAEQDG 
ADFGEDGMKK DDQIAAAIAL RGMAKDGKFA VKNDEKGKAE 
AADKDSVKGI AKGIKEIVEA AGGSEKLKAA AAEGENNKKA 
AGAVSAVSGE QILSAIVKAA DAAEQDGKKP ADATNPIAAA 
AAAIALRGMA KDGKFAVKGN NEKGKAEGAS SGTDAIGEW 
EIVEAAGGSE KLKAVAAATR ENNKEAGKLF GKVDDAHAGD 
SAIVTAAAAG EQDGEKPAEA TNPIAAAIGK GNEDGADFGK 
DGKFAVKSND GEKGKAEGAI KEVSELLDKL VKAVKTAEGA 
DKASVTGIAK GIKEIVEAAG GSKKLKAAAA EGENNKKAGK 
AAGAVSAG 

t24-l .aa 

GEAEQIXSEKPEDAKNPIAAAIGKGNGDGAEFDQDEMKKDDQIAAAIALRGMAKDGKFAVKGNNEKEKAEGAIKEVS 
ELLDKLWAVKTAEGASSGTDAIGEVVDNXAKXADKASVTGIAKGIKEIVEAAXGSEKLKVAAAXXXNNKEAGKLF 
GKAGADANGDSEAASK 

f28-2 .nt 

TAAAAAGGAA ATATAAATAT 
TTAGTATTTA GTTCTCTTTT 
ATAGAACAGT TTGCATTAGC 
TCAGTAGATA AAAATAGTAA 
AAAACTTATG ATCCAATCTT 
GCTAATAATA AAGAATCCCT 
GCTCAAAATA ATGTAAAGAT 
GAACAAAGTA ATTTTAAAAA 
GAAGAAGAAA TTAAAGCTAA 
CTTTCAGAAA TTAAAAATGC 
AACAATACAC TCCTTGAGTT 
AATTTAGACG CATCTCCTTT 
ATGCGTTCTT TTATGGAGCA 
GAGGCTAAGG ATAAACTAGC 
CGGTTCGGTG GCAGTTTTAA 
GCTATAGACT TTGCTTCTGC 
CAGGGAAATT CTTGCAAAAA 

t28-2 .nt 

AAAAGATCATCAAGAAAATAAAAATACTACTAATACTTCAGTAGATAAAAATAGTAAGGAAATTGAATCTCCTAAA 

GACGTTACATCATCAAATAAAAAAACTTATGATCCAATCTTACAAGTAGGTTCTAATCAACATATGTCAGATGATC 

CTGGTGCTAATAATAAAGAATCCCTACCAAATTCAAGTCCAGCAATAATACAAAATGACTCGCATGCTCAAAATAA 

TGTAAAGATGGAAGAAAATAAATCAGCTACTCCACAACATGATCCAATTGAACAAAGTAATTTTAAAAATAGCCTT 

ACTACAACAAGTAAAACTCCTGCTATTCCTTCAGAAGAAGAAATTAAAGCTAACTTAGATGAATTTGCACAAGAAG 

AGTATGAGCAAACATCTCTTTCAGAAATTAAAAATGCCACGCAAATTGTTAATCATGCTAATCCTGAAAACAAATT 

AAACAATACACTCCTTGAGTTTGAA/U^GATTATGAAACTTTATCAAACTTGTTATTCTCTAATTTAGACGCATCT 

CCTTTGAATAGAAAAATAAAGACTATTATGCCTAAATTACAAGAAATGCGTTCTTTTATGGAGCAAGCi^ 

CTTGGGTATCTGCTAAAGGCATGCTAGATGAGGCTAAGGATAAACTAGCAGAATCTATTTATAAAAGACTATACAA 

TGGCAATTCATACCGGTTCGGTGGCAGTTTTAACGGACGTGATATGCAACATGCAAAAAATTTAGCATACAGAG 

ATAGACTTTGCTTCTGCATGCATTGAATATACACAAAAAGCTATTGATTATCTTCAACAGGGAAATTCTTGCAA^ 

AAGAAATAGAAAATATATTCAAG 



LRGMAKDGKF 
KVADKASVTG 
EKPAEAKNPI 
GAIKGAAAIG 
GKLFGKVDGA 
IGNKDEDADF 
DNDAKAADKA 
SEAASKAAGA 
DEMKKDDQIA 
SSGTDAIGEV 
IiFGKAGAGAG 



AVKDGGEKEK 
lAKGIKEIVE 
AAAIGKGDGD 
EWDNAGAAK 
AGDSEAASKA 
GDGMKKDDQI 
SVTGIAKGIK 
VSAVSGEQIL 
AAIALRGMAK 
VANAGAAKAA 
ANGDSEAASK 



TATGCGATTA TGTTTAATAA 

TTTATTTGAA AGTTGTTCTG 

ATTAAAAGAT CATCAAGAAA 

GGAAATTGAA TCTCCTAAAG 

ACAAGTAGGT TCTAATCAAC 

ACCAAATTCA AGTCCAGCAA 

GGAAGAAAAT AAATCAGCTA 

TAGCCTTACT ACAACAAGTA 

CTTAGATGAA TTTGCACAAG 

CACGCAAATT GTTAATCATG 

TGAAAAAGAT TATGAAACTT 

GAATAGAAAA ATAAAGACTA 

AGCAACTAAT TCTTGGGTAT 

AGAATCTATT TATAAAAGAC 

CGGACGTGAT ATGCAACATG 

ATGCATTGAA TATACACAAA 

AGAAATAGAA AATATATTCA 



AAATTTTTAT TATACCTAAT 
GTTTTCTATC TAAAAAATCT 
ATAAAAATAC TACTAATACT 
ACGTTACATC ATCAAATAAA 
ATATGTCAGA TGATCCTGGT 
TAATACAAAA TGACTCGCAT 
CTCCACAACA TGATCCAATT 
AAACTCCTGC TATTCCTTCA 
AAGAGTATGA GCAAACATCT 
CTAATCCTGA AAACAAATTA 
TATGAAACTT GTTATTCTCT 
TTATGCCTAA ATTACAAGAA 
CTGCTAAAGG CATGCTAGAT 
TATACAATGG CAATTCATAC 
CAAAAAATTT AGCATACAGA 
AAGCTATTGA TTATCTTCAA 
AGCTTTAA 



f 28-2 .aa 
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KGNINIMRLC LIKIFIIPNL VFSSLFLFES CSGFLSKKSI EQFALALKDH QENKNTTNTS 
VDKNSKEIES PKDVTSSNKK TYDPILQVGS NQHMSDDPGA NNKESLPNSS PAIIQNDSHA 
QNNVKMEENK SATPQHDPIE QSNFKNSLTT TSKTPAIPSE EEIKANLDEF AQEEYEQTSL 
SEIKNATQIV NHANPENKLN NTLLEFEKDY ETLSNLLFSN LDASPLNRKI KTIMPKLQEM 
RSFMEQATNS WVSAKGMLDE AKDKLAESIY KRLYNGNSYR FGGSFNGRDM QHAKNLAYRA 
IDFASACIEY TQKAIDYLQQ GNSCKKEIEN IFKL 

t28-2 .aa 

KDHQENKNTTNTSVDKNSKEIESPKDVTSSNKKTYDPILQVGSNQHMSDDPGANNKESLPNSSPAIIQNDSHAQN^ 
VKMEENKSATPQHDPIEQSNFKNSLTTTSKTPAIPSEEEIKANLDEFAQEEYEQTSLSEIKNATQIVNHANPENKL 
NNTLLEFEKDYETLSNLLFSNLDASPLNRKIKTIMPKLQEMRSFMEQATNSWSAKGMLDEAKDKIAESIYKRL 
GNSYRFGGSFNGRDMQHAKNLAYRAIDFASACIEYTQKAIDYLQQGNSCKKEIENIFK 

f28-3.nt 

TAGATGAATT TAATTGCTAA ATTATTTATT TTATCCACTT TAGTTTCAAT TCCAAATATC 

CTCTCTTGTA ACCTATATGA TAATCTTGCA GACAACGCTG AGCAGGTTAC AGACATACTA 

GACAACAACA AGTCTTTTAA TACTTTAGGA AGCAGCAATG AGAGTAGAAG TCGCAGGCCT 

AGAAGTACAA ATAATGCTTA TATGAAACAA AACATAGACA AAAATCATTT AGTTGTTGCA 

GATATGCAAA ATGATAATAG TAGCAGCAGT CTTCCCCAAC AAGTTAATAG TGAATCCAGT 

AAAGCTAATG AAGATAGTAA TATTATGAAG GAAATTGAAT CTTCTACAGA AGAGTGCGCT 

AGACTAAGAA AAGATTTAGA AACTATAAAA CAAATACTTG ATAATATAGA AAGCTTGCTT 

AATACAGCTA ATTCTTATTT AGAGAACGCT AGAT^AAGCAC CTAAATCTAA TCAAGATAAT 

CAAACCTTAT TGCTTAGCCT GCACCAAGCT ATTGCTAAGG TTAAGAGTAG TCATACTTCT 

TTTATCATTT GTTATAATGA TGCATTTAAT TCCCTGGGAA TAGCTGATAC TGCCTTTAAA 

GATGCAAAGA GAAAGGCAGT TGAGGCATAA 

t28-3 .nt 

TTGTAACCTATATGATAATCTTGCAGACAACGCTGAGCAGGTTACAGACATACTAGACAACAACAAGTCTTTTAAT 
ACTTTAGGAAGCAGCAATGAGAGTAGAAGTCGCAGGCCTAGAAGTACAAATAATGCTTATATGAAACAAAACATAG 
ACAAAAATCATTTAGTTGTTGCAGATATGCAAAATGATAATAGTAGCAGCAGTCTTCCCCAACAAGTTAATAGTGA 
ATCCAGTAAAGCTAATGAAGATAGTAATATTATGAAGGAAATTGAATCTTCTACAGAAGAGTGCGCTAGACTAAGA 
AAAGATTTAGAAACTATAAAACAAATACTTGATAATATAGAAAGCTTGCTTAATACAGCTAATTCTTATTTAGAGA 
ACGCTAGAAAAGCACCTAAATCTAATCAAGATAATCAAACCTTATTGCTTAGCCTGCACCAAGCTATTGCTAAGGT 
TAAGAGTAGTCATACTTCTTTTATCATTTGTTATAATGATGCATTTAATTCCCTGGGAATAGCTGATACTGCCTTT 
AAAGATGCAAAGAGAAAGGCAGTTGAGGCA 

f28-3 .aa 

MNLIAKLFIL STLVSIPNIL SCNLYDNLAD NAEQVTDILD NNKSFNTLGS SNESRSRRPR 
STNNAYMKQN IDKNHLWAD MQNDNSSSSL PQQVNSESSK ANEDSNIMKE lESSTEECAR 
LRKDLETIKQ ILDNIESLLN TANSYLENAR KAPKSNQDNQ TLLLSLHQAI AKVKSSHTSF 
IICYNDAFNS LGIADTAFKD AKRKAVEA 

t28-3 .aa 

C^^^YDNLADNAEQVTDILDN1^SFIOTLGSSNESRSRRPRSTNNAYMKQNIDK^ 

SSKANEDSNIMKEIESSTEECARLRKIDLETIKQILDNIESLLNTANSYLENARKAPKSNQDNQTLLLSLHQAIAKV 
KSSHTSFI ICYNDAFNSLGIADTAFKDAKRKAVEA 

f31-2.nt 

TAAAAAAATA AGGAGGTATT AATGAAAAGG T^AAAGCAATA TATGTATTTC ACTTCTAGTC 
ACAATATTAT TTGTGTCTTG CAAGTTTTTT GGAAATAAAA GCGCAAGTAA AGAA/^AAGAA 
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GAAACTTCTT TTTCTGATAC TGCTAGCAAG ATTAGTAAGT CGGGAACAGC TGCTTCTTCA 
GACAAACAAG AAAAAAATAC AAGTGATGTT ACAGGTGACG CCAAAAAGCA TACTAGTAGC 
CCTTACATGC TTGCTGATGC CCTTATTGTT AGTGATACTA CTAATAGAGA TAGAGATAAG 
CAAGAAAATA AAGATAAATT AAATGAAGAA GATAAAAAAA AGCTTAATGC TTTTTTTAGC 
ACAACTAAAA CATATCAATC TAGCCTAGAT TCCATTTATA ACAAATATAC AGGCTATTAT 
AATACCATTG ATACCTATGG CAGCTGTGAT ACGTATCGCA TTGAGTGTTT TAGTGTAGGA 
CCTTCTGAAA AACGTAAACA AGCTCTTGCT GATCTAGAGA AGTTAAAACT AGACGAAAAG 
TACACTCAGC TTAGCACAAT GTTAAAGAGT GCTGTGCCTA GTTATTACAA AAAAAATTTA 
GATGATTCTA TTGCACAGTA TAAGGAAGCC ATAAAGCAGG CTATTGAAGC TGAAAGTAAA 
ATAGAGACAG TAAAAGACTA TGCAACAGCT CAAAGTGCTG CCGATGACGA AAAGAAAAGA 
T^TATAGATA ATTTAAAAAT AGTTAGAGAT GTTCTTCTTA TTATTAAAAA AACTATTGAG 
AAAGCCAGCC GATCTTATGC TGATGCTTTT GCTATTGCAA CATCTAGCTT ATCTTGTAGC 
GAATTTAAGC AAGCTGTTAA AGAGTTTAAT GATGCTGCTA AACAATATGC TAATGGAAAT 
AAAGGAGACA ATGCTGTCAA TGTTATTGTA GGCACTATTT CTAGTATGCC TTATGTCAAA 
TTTAAAGATG AGTTTGCAAG AGCAAAAATG TTTGCTCGTA ATTATAGAGG AGACGAGGTA 
GACAAGATGA TAAGAGCTAT CGACAAGCTG TGTGATGTTT ATAAAAAAGT TGCGCTTTAG 

t31-2 .nt 

TTGCAAGTTTTTTGGAAATAAAAGCGCAAGTAAAGAAAAAGAAGAAACTTCTTTTTCTGATACTGCTAGCAAGATT 
AGTAAGTCGGGAACAGCTGCTTCTTCAGACAAACAAGAAAAAAATACAAGTGATGTTACAGGTGACGCCAAAAAGC 
ATACTAGTAGCCCTTACATGCTTGCTGATGCCCTTATTGTTAGTGATACTACTAATAGAGATAGAGATAAGCAAGA 
AAATAAAGATAAATTAAATGAAGAAGATAAAAAAAAGCTTAATGCTTTTTTTAGCACAACTAAAACATATCAATCT 
AGCCTAGATTCCATTTATAACAAATATACAGGCTATTATAATACCATTGATACCTATGGCAGCTGTGATACGTATC 
GCATTGAGTGTTTTAGTGTAGGACCTTCTGAAAAACGTAAACAAGCTCTTGCTGATCTAGAGAAGTTAAAACTAGA 
CGAAAAGTACACTCAGCTTAGCACAATGTTAAAGAGTGCTGTGCCTAGTTATTACAAAAAAAATTTAGATGATTCT 
ATTGCACAGTATAAGGAAGCCATAAAGCAGGCTATTGAAGCTGAAAGTAAAATAGAGACAGTAAAAGACTATGCAA 
CAGCTCAAAGTGCTGCCGATGACGAAAAGAAAAGAAATATAGATAATTTAAAAATAGTTAGAGATGTTCTTCTTAT 
TATTAAAAAAACTATTGAGAAAGCCAGCCGATCTTATGCTGATGCTTTTGCTATTGCAACATCTAGCTTATCTTGT 
AGCGAATTTAAGCAAGCTGTTAAAGAGTTTAATGATGCTGCTAAACAATATGCTAATGGAAATAAAGGAGACAATG 
CTGTCAATGTTATTGTAGGCACTATTTCTAGTATGCCTTATGTCAAATTTAAAGATGAGTTTGCAAGAGCAAAAAT 
GTTTGCTCGTAATTATAGAGGAGACGAGGTAGACAAGATGATAAGAGCTATCGACAAG 

f31-2 .aa 

KNKEVLMKRK SNICISLLVT XLFVSCKFFG NKSASKEKEE TSFSDTASKI SKSGTAASSD 

KQEKNTSDVT GDAKKHTSSP YMLADALIVS DTTNRDRDKQ ENKDKLNEED KKKLNAFFST 

TKTYQSSLDS lYNKYTGYYN TIDTYGSCDT YRIECFSVGP SEKRKQALAD LEKLKLDEKY 

TQLSTMLKSA VPSYYKKNLD DSIAQYKEAI KQAIEAESKI ETVKDYATAQ SAADDEKKRN 

IDNLKIVRDV LLIIKKTIEK ASRSYADAFA lATSSLSCSE FKQAVKEFND AAKQYANGNK 

GDNAVNVIVG TISSMPYVKF KDEFARAKMF ARNYRGDEVD KMIRAIDKLC DVYKKVAL 

t31-2 .aa 

CKFFGNKSASKEKEETSFSDTASKISKSGTAASSDKQEKNTSDVTGDAKKHTSSPYMLADALIVSDTTNRDRDKQE 

NKDKIJSTEEDKKKLNAFFSTTKTYQSSLDSIYNKYTGYYNTIOTYGSCOTYRIECFSVGPSEKJ^KQAL^ 

EKYTQLSTMLKSAVPSYYKKNl^DSIAQYKEAIKQAIEAESKIETVKDYATAQSAADDEKKRNIDNLKIVRDVLLI 

IKKTIEKASRSYADAFAIATSSLSCSEFKQAVKEFNDAAKQYANGNKGDNAVIWIVGTISSMPYVKFKDEFARAKM 

FARNYRGDEVDKMIRAIDK 

£32-4. nt 

TAAGGAAATA TGAGGAATAT TAGCAATTGT ATCAAATATA TTATATTAAC AATGCTTATT 
GGATTATTAA TTTTTTGTTG TGCAACCTTT GTTTGGTTGA TTGGAATTTT TTATTCAAAT 
AACTTTAAAG AAGAGCGGAA TTATTCAATA AGCCCAATAG ATAGTGTTAT TATGCGTAAA 
TGTTATTTTA AAGAATTTAA GTCTGGACTT ATTAAAAGCG TATTCTTTAA GAAATTAGAT 
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GTAAATGTTA ACTCTAAAAA TTTTAAGGAG CTAAATAAGG TAGATAAACA AAATCTGCTA 
AATTCTTATC CATCTTATCA TATGGAGTTT GTCGTAGTTG ATAATGGATT TTTAATGAAT 
TTTAAAAATG TTATTTTTAA TGGTATAGAT GATGCTAAAT TATACGATCA ACGTGATATG 
GTTTACGGAG GATTTAGATA CTCAAAAGAG GCTTATTTCC AAATTATTGG CAATTATGAT 
GTTAAATTAA ATAAAATGAA ACAATATACT CCAGCAATTG TAGTAAATGT TTTCAAAATT 
AACATTAATG ATGCTTTATT TAACTCGTTA TTAAAGCAAA AAACTTTAAA AGTTACTTTG 
ATTTCCCATA ATAATAAAGA GTATATTTTA CAAACTAATA ATTTCTTATC AAAGTATAAT 
TTTCAAACAC CAGAAAAGGA GAATAGTTCT TACTAA 

t32-4.nt 

AAATAACTTTAAAGAAGAGCGGAATTATTCAATAAGCCCAATAGATAGTGTTATTATGCGTAAATGTTATTTTAAA 
GAATTTAAGTCTGGACTTATTAAAAGCGTATTCTTTAAGAAATTAGATGTAAATGTTAACTCTAAAAATTTTAAGG 
AGCTAAATAAGGTAGATAAACAAAATCTGCTAAATTCTTATCCATCTTATCATATGGAGTTTGTCGTAGTTGATAA 
TGGATTTTTAATGAATTTTAAAftATGTTATTTTTAATGGTATAGATGATGCTAAATTATACGATCAACGTGATATG 
GTTTACGGAGGATTTAGATACTCAAAAGAGGCTTATTTCCAAATTATTGGCAATTATGATGTTAAATTAAATAAAA 
TGAAACAATATACTCCAGCAATTGTAGTAAATGTTTTCAAAATTAACATTAATGATGCTTTATTTAACTCGTTATT 
AAAGCAAAAAACTTTAAAAGTTACTTTGATTTCCCATAATAATAAAGAGTATATTTTACAAACTAATAATTTCTTA 
TCAAAGTATAATTTTCAAACACCAGAAAAGGAGAATAGTTCTTAC 

f32-4 .aa 

GNMRNISNCI KYIILTMLIG LLIFCCATFV WLIGIFYSNN FKEERNYSIS PIDSVIMRKC 

YFKEFKSGLI KSVFFKKLDV NVNSKNFKEL NKVDKQNLLN SYPSYHMEFV WDNGFLMNF 

KNVIFNGIDD AKLYDQRDMV YGGFRYSKEA YFQIIGNYDV KLNKMKQYTP AIWNVFKIN 

INDALFNSLL KQKTLKVTLI SHISTNKEYILQ TNNFLSKYNF QTPEKENSSY 

t32-4 .aa 

NNFKEERJSTiTSISPIDSVIMRKCYFKEFKSGLIKSVFFKKLDVlNrVNSKNFKELNKV^ 
GFLMNFKNVIFNGIDDAKLYDQRDlWYGGFRYSKEAYFQIIGNYDVKLJSrKMKQYTPAIVVN^ 
KQKTLKVTLISHNNKEYILQTNNFLSKYNFQTPEKENSSY 

f4-15.nt 

TAAATGAGCA AAAAAGTAAT TTTAATATTA CTAGAAATTT 
TCTATAAATA AAGAACAAAA AACCAAAGAA AAAACATCTG 
CAAAATATTG AAAAACAAGA GCCTGAAAAA CAGAAACAAA 
ACGGTATCAA TTCAT^CGGT AGAAATAAGG GAATCAAATC 
AAGTACTACA AGCAAGCTTA TCCGATTCAA ACATTCACTC 
GAAAAGGAAT TTCTAAAACC AGAAGATAAA ATCTTGCCCA 
TTGAGCATCT TAATAAATAA AAAATTGTTA GACTTTAAAG 
TCAACTTTAA AAAATTTCAA AGAAATTAAA AATATTGAGA 
TTATTATTTG TCTTAACCCT TAAAGATAAA AATAACAACA 
AATCCCCCAA ACGACATCCA AAAACCCAAA GATTATATTT 
ATTAAAAAGG GTACTGGTGA GAAATACTTA AATCCTATCT 
AAAAAAGATT ATCATTCAAT AGATTACAAC AAAGTGACTA 
TTGGACCTAC TGCCTCACGA ACAAGTCTTT CAAATGAATA 
GACACAATAA CAGACTTAAA TAATCTAAAA TTAGTAATTC 

t4-15.nt 

TTGTGATTTATCTATAAATAAAGAACAAAAAACCAAAGAAAAAACATCTGAAAAGCAAGAATCTGAAAAACA 
ATTGAAAAACAAGAGCCTGAAAAACAGAAACAAAATGCAGCAAAAATAATCCCTACGGTATCAATTCAAACGGTAG 
AAATAAGGGAATCAAATCAAATTCCAAAAAGCATTGAGAAGTACTACAAGCAAGCTTATCCGATTCAAACA 
TCTTGATTTTAGCATCACAAGAGAAAAGGAATTTCTAAAACCAGAAGATAAAATCTTGCCCACACAGG^ 



TGATCTTGTC 
AAAAGCAAGA 
ATGCAGCAAA 
AAATTCCAAA 
TTGATTTTAG 
CACAGGGGAA 
CCCCAGAAAA 
ATTTCTTCCA 
ACACTATTAA 
TAAAAGACCT 
ATAGATTTCA 
TTAGCGAAAA 
AAAATTTCAC 
AAAAAGAATT 



TTGTGATTTA 
ATCTGAAAAA 
AATAATCCCT 
AAGCATTGAG 
CATCACAAGA 
AGTGGAGTCT 
TCCAAAAAGC 
AAATCAAGAC 
CATCATGCTC 
TAAAGACACA 
AATAAAAAAC 
AACAATAGAA 
TAAAATTTTA 
AGTGTAA 
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GAGTCTTTGAGCATCTTAATAAATAAAAAATTGTTAGACTTTAAAGCCCCAGAAAATCCAAAAAGCTCAACTTTAA 
AAAATTTCAAAGAAATTAAAAATATTGAGAATTTCTTCCAAAATCAAGACTTATTATTTGTCTTAACCCTTAAAGA 
TAAAAATAACAACAACACTATTAACATCATGCTCAATCCCCCAAACGACATCCAAAAACCCAAAGATTATATTTTA 
AAAGACCTTAAAGACACAATTAAAAAGGGTACTGGTGAGAAATACTTAAATCCTATCTATAGATTTCAAATAAAAA 
ACAAAAAAGATTATCATTCAATAGATTACAACAAAGTGACTATTAGCGAAAAAACAATAGAATTGGACCTACTGCC 
TCACGAACAAGTCTTTCAAATGAATAAAAATTTCACTAAA 

£4-15. aa 



MSKKVILILL EILILSCDLS INKEQKTKEK 
VSIQTVEIRE SNQIPKSIEK YYKQAYPIQT 
SILINKKLLD FKAPENPKSS TLKNFKEIKN 
PPNDIQKPKD YILKDLKDTI KKGTGEKYLN 
DLLPHEQVFQ MNKNFTKILD TITDUMNLKL 



TSEKQESEKQ NIEKQEPEKQ KQNAAKIIPT 
FTLDFSITRE KEFLKPEDKI LPTQGKVESL 
lENFFQNQDL LFVLTLKDKN NNNTINIMLN 
PIYRFQIKNK KDYHSIDYNK VTISEKTIEL 
VIQKELV 



t4-15 .aa 



CDLSINKEQKTKEKTSEKQESEKQNIEKQEPEKQKQNAAKIIPTVSIQTVEIRESNQIPKSIEKYYKQAYPIQTFT 
LDFSITREKEFLKPEDKILPTQGKVESLSILINKKLLDFKAPENPKSSTLKNFKEIKNIENFFQNQDLLFVLTLKD 
KNlSn^TINIMLNPPNDIQKPKDYILKDLKDTIKKGTGEKYLNPIYRFQIKNKKDYHSIDYNKVTIS 
HEQVFQMNKNFTK 



f4-50.nt 



TAGAAGGAGG AAAAAATGAA 
AAACTATTGG TCGCATGTAG 
TCCTCTAAGG ATTTAAAAAA 
GTACTTTTTG AAGCTTTTAC 
GCCTTAAGAG AAGCAAAAGT 
GAAGAAGAAG CTTTAAAGCT 
GACTTAATGC TTGAGGTTGT 
CGTGTTTTAG AGGAATCTAA 
AAAGCTCAAA TAGAAAATCA 
GGAGAGAAAA AAAATAATAA 



AATTGGAAAG CTAAATTCAA 
TATTGGATTA GTAGAAAGAA 
CAAAATTTTA AAAATAAAAA 
AGGTCTTAAA ACCGGTTCCA 
ACAAGCCATT GTTGAAACAG 
TAAAGAAACT GGAAACAGTG 
AGAATCGCTA GAAGACGTTG 
AAATAATCCT ATAAACACAG 
ACTTAAAGTG GTTAAGGAAA 
AAGCAAAAAA AAGAAATAA 



TAGTTATAGC CTTGTTTTTT 
CAAATGCAGC TCTTGAATCG 
AAGAAGCCAC GGGAAAAGGT 
AGGTAACAAG TGGTGGACTA 
GAAAGTTCCT TAAGATAATA 
GTCAATTCTT GGCTATGTTT 
GAATAATAGG CTTAAAAGCC 
CTGAAAGATT GCTTGCGGCT 
AACAAAATAT TGAAAATGGT 



t4-50 .nt 



ATGTAGTATTGGATTAGTAGAAAGAACAAATGCAGCTCTTGAATCGTCCTCTAAGGATTTAAAAAACAAAATTTTA 
AAAATAAAAAAAGAAGCCACGGGAAAAGGTGTACTTTTTGAAGCTTTTACAGGTCTTAAAACCGGTTCCAAGGTAA 
CAAGTGGTGGACTAGCCTTAAGAGAAGCAAAAGTACAAGCCATTGTTGAAACAGGAAAGTTCCTTAAGATAATAGA 
AGAAGAAGCTTTAAAGCTTAAAGAAACTGGAAACAGTGGTCAATTCTTGGCTATGTTTGACTTAATGCTTGAGGTT 
GTAGAATCGCTAGAAGACGTTGGAATAATAGGCTTAAAAGCCCGTGTTTTAGAGGAATCTAAAAATAATCCTATAA 
ACACAGCTGAAAGATTGCTTGCGGCTAAAGCTCAAATAGAAAATCAACTTAAAGTGGTTAAGGAAAAACAAAATAT 
TGAAAATGGTGGAGAGAAAAAAAATAATAAAAGCAAAAAAAAGAAA 



f4-50.aa 



KEEKMKIGKL NSIVIALFFK 

LFEAFTGLKT GSKVTSGGLA 

LMLEWESLE DVGIIGLKAR 

EKKNNKSKKK K 



LLVACSIGLV ERTNAALESS 
LREAKVQAIV ETGKFLKIIE 
VLEESKNNPI NTAERLLAAK 



SKDLKNKILK IKKEATGKGV 
EEALKLKETG NSGQFLAMFD 
AQIENQLKW KEKQNIENGG 



t4-50 .aa 
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CSIGLVERTNAALESSSKDLKNKILKIKKEATGKGVLFEAFTGLKTGSKVTSGGLALREAKVQAIVETGK 

EEALKLKETGNSGQFLAMFDLMLEVVESLEDVGIIGLKARVLEESKNNPINT^ 

ENGGEKKNNKSKKKK 

£4-66. nt 

TAATTTTTAA AATTTAAATA TTTACATAAT AGTAATGTGT GTGGGAGACG TATGAAAAAT 
ATTTTATTAT TTGTTATTTT ATTATTCTTT TCTTGTAAAG AATTTAATTA TTCTGATCTT 
AGGAGAAGGC CTTCAAAGGT TTTAAATGCT TCTAATGGTG CATCAAATAA AGAACTTAAA 
ATTTCTTTTG TAGATTCTTT AAATGATGAT CAAAAAGAAG CTTTGTTTTT TCTTGAACAG 
GTAGTTCTTG ATAGCAATCC CGACAAGTTT AATCAAATTT TTAATTTAAA TGAAGAGAAG 
GTAAAAGAAA TGCTTGTTAC TGTTGTTAAG TGTTTAAAGG CCAAAAGAAA GGCTAAAATG 
GCTCTTGAGA GCTCAAATGT TGCAAATGTT GCCAATGCTA AACAGCAATT GCTACAGGTT 
GAAAAAACTT ACATAGATAA TTTGCGACAA TCTTTTATGA CTACTAAAAA CATTGAAGAG 
GCTTGTAATC TTGTAAAAAA TTATGATGCA TCTGCTTCGT TTTAA 

t4-66 .nt 

TTGTAAAGAATTTAATTATTCTGATCTTAGGAGAAGGCCTTCAAAGGTTTTAAATGCTTCTAATGGTGCATCAAAT 
AAAGAACTTAAAATTTCTTTTGTAGATTCTTTAAATGATGATCAAAAAGAAGCTTTGTTTTTTCTTGAACAGGTAG 
TTCTTGATAGCAATCCCGACAAGTTTAATCAAATTTTTAATTTAAATGAAGAGAAGGTAAAAGAAATGCTTGTTAC 
TGTTGTTAAGTGTTTAAAGGCCAAAAGAAAGGCTAAAATGGCTCTTGAGAGCTCAAATGTTGCAAATGTTGCCAAT 
GCTAAACAGCAATTGCTACAGGTTGAAAAAACTTACATAGATAATTTGCGACAATCTTTTATGACTACTAAAAACA 
TTGAAGAGGCTTGTAATCTTGTAAAAAATTATGATGCATCTGCTTCGTTT 

£4-66. aa 

FLKFKYLHNS NVCGRRMKNI LLFVILLFFS CKEFNYSDLR RRPSKVLNAS NGASNKELKI 
SFVDSLNDDQ KEALFFLEQV VLDSNPDKFN QIFNLNEEKV KEMLVTWKC LKAKRKAKMA 
LESSNVANVA NAKQQLLQVE KTYIDNLRQS FMTTKNIEEA CNLVKNYDAS ASF 

t4-66 .aa 

CKEFNYSDLRRRPSKVLNASNGASNKELKISFVDSLNDDQKEALFFLEQWLDSNPDKFNQIFNLNEEKVKEMLVT 
VVKCLKAKRKAKMALESSWAWANAKQQLLQVEKTYIDNLRQSFMTTKNIEEACNLVK^ 

£42-1. nt 

TAATTATTAA AATCTAAGGA GAAGAGATTT ATGAACAAAA AATTTTCTAT TTCATTATTA 
TCTACAATAT TAGCCTTCTT GTTAGTATTA GGTTGTGATT TGTCAAGCAA TAATGCTGAA 
AACAAAATGG ATGATATTTT TAATTTAGAA AAGAAATACA TGGATAATTC AAATTATAAA 
TGTTTAAGTA AAAATGAGGC TATAGTTAAA AATTCTAAAA TTAAATTAGG TGTAAATAAT 
ACTAGAAGTC GTTCTTATTC TTCTAGAGAG ACTAATGTTT CGGATTCCTA TAATAAAACC 
TATTCATATT GCAAAAGCAA CTGA 

t42-l .nt 

TTGTGATTTGTCAAGCAATAATGCTGAAAACAAAATGGATGATATTTTTAATTTAGAAAAGAAATACATGGATAAT 
TCAAATTATAAATGTTTAAGTAAAAATGAGGCTATAGTTAAAAATTCTAAAATTAAATTAGGTGTAAATAATACTA 
GAAGTCGTTCTTATTCTTCTAGAGAGACTAATGTTTCGGATTCCTATAATAAAACCTATTCATATTGCAAAAGCAA 
C 

£42-1. aa 

LLKSKEKRFM NKKFSISLLS TILAFLLVtX3 CDLSSNNAEN KMDDIFNLEK KYMDNSNYKC 
LSKNEAIVKN SKIKLGVNNT RSRSYSSRET NVSDSYNKTY SYCKSN 



wo 98/59071 

199 

TABLE 1. Nucleotide and Amino Acid Sequences 
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CDLSSNNAENKMDDIFNLEKKYMDNSl^n"KCLSKWEAIVKNSKIKLGVm 
f43-3.nt 

TGAATATTAA TAATAAAAAA AGGAATAANA ATGAAAATTA TCAACATATT ATTTTGTTTA 
TTTTTACTAA TGCTAAACAG CTGTAATTCT AATGATACTA ATACTAGCCA AACAAAAAGT 
AGACAAAAAC GTGATTTAAC CCAAAAAGAA GCAACACAAG AAAAACCAAA ATCTAAAGAA 
GACCTGCTTA GAGAAAAGCT ATCTGAAGAC CAAAAAACAC ATCTTGACTG GTTAAAAACC 
GCTTTAACTG GTGCTGGAGA ATTTGATAAA TTTTTAGGAT ATGACGAAGA CAAAATAAAA 
GGTGCACTTA ATCATATAAA GAGTGAACTT GATAAGTGTA CTGGGGATAA TTCTGAACAA 
CAAAAAAGCA CCTTCAAAGA GGTGGTTAAG GGGGCTCTTG GTGGCGGTAT AGATAGTTTT 
GCAACTAGTG CAAGTAGTAC CTGCCAAGCT CAGCAATAA 

t43-3 .nt 

CTGTAATTCTAATGATACTAATACTAGCCAAACAAAAAGTAGACAAAAACGTGATTTAACCCAAAAAGAAGCAACA 
CAAGAAAAACCAAAATCTAAAGAAGACCTGCTTAGAGAAAAGCTATCTGAAGACCAAAAAACACATCTTGACTGGT 
TAAAAACCGCTTTAACTGGTGCTGGAGAATTTGATAAATTTTTAGGATATGACGAAGACAAAATAAAAGGTGCACT 
TAATCATATAAAGAGTGAACTTGATAAGTGTACTGGGGATAATTCTGAACAACAAAAAAGCACCTTCAAAGAGGTG 
GTTAAGGGGGCTCTTGGTGGCGGTATAGATAGTTTTGCAACTAGTGCAAGTAGTACCTGCCAAGCTCAGCAA 

f43-3 .aa 

ILIIKKGIXM KIINILFCLF LLMLNSCNSN DTNTSQTKSR QKRDLTQKEA TQEKPKSKED 
LLREKLSEDQ KTHLDWLKTA LTGAGEFDKF LGYDEDKIKG ALNHIKSELD KCTGDNSEQQ 
KSTFKEWKG ALGGGIDSFA TSASSTCQAQ Q 

t43-3 .aa 

CNSNDTNTSQTKSRQKRDLTQKEATQEKPKSKEDLLREKLSEDQKTHLDWLKTALTGAGEFDKFLGYDEDKIKGAL 
NHIKSELDKCTGDNSEQQKSTFKEWKGALGGGIDSFATSASSTCQAQQ 

£45-2 .nt 

TAGGAGAGAA TAATTATGAA 
ATTTCTTGCA AGAATTTTGC 
AAAGGATTTG TAAATAAGAT 
AAAGTAGATG AAGTAGCAAA 
GATGATCCTA ATGGCAGTGG 
AATGCATTAG TATTAAAAGC 
GAAGCTGAAG CTAAAGTTGA 
AAAGAAAAAG AAATAATAGA 
GAACAACAAA AAGAACAAAA 
AAAGAAAAAA GAGAAAGAGA 
AAAAGGCAAG TTGATAACCA 
AATATTGATG TTATAAAATG 
ACTGGGCCTG TGTATGATGA 
GGGTTAGAAG AGGAATCAGA 
GCTAGGGACG CGCTAAGAAC 
GAGCCTAAGT TAAAAGAAAG 
AAATCAAAAT TAGAAGAAGT 
AAAGGATACA TCAGTGACAG 

t45-2 .nt 




PCT/US98/12718 



TAAAAAAACA TTGATTATTT 
AACTGGTAAA GATATAAAAC 
TTTAGATCCA GTAAAGGATA 
AAAATTACAA GAAGAAGAAA 
AATAAATCCG CCACCAGTAT 
AATAGAACAA AGTGATGGTC 
AGAAAATAAA GAAAAACAAG 
CGAACAAAAC AAACAAGAAT 
AAGACATCAA GAAGAGCAAC 
AGAGGCAGAA CAACAAAAAC 
AATTAAAACA CTTATAGCTA 
GCAAACGACT GTAGGCCCAC 
TTTTACCAAT GGCAATAATT 
AGACGAAGGA TTAGGAAAAT 
TAAATTAAAT GAAGGCAATA 
TGTAAATGTT AGCGAAATTA 
TAAAAAATAT CTTAAAGATA 
TCAGTAA 



GTGCTGTTTT TGCGCTGATA 
AAAATTCAGA AGGGAAAATT 
AAATTGCTTC AAGTGGTACA 
AAGAAGAATT AATGCAGGGC 
TGCCGGAAAA TATTCACAAT 
AACAAGAAAA AAAAGTAGAA 
AGAATACAGA AGAAAACATT 
TAGCTAAAGC TAAAGAAGAA 
AAAGAAAAGC TAAAGCAGAA 
GACAACAAGA AGAGGAAGAA 
AAATAGATGA GATCAATGAA 
AAGGCGTTAT AGATAGAATT 
CTATACGCGA AACTTGGGAG 
TATTGAAAGA ATTGAGTGAT 
AACCATATAC TGGTTACGAA 
AAGAAGATTT AGAAAAATTA 
GTTCTAAATT TGAAGAAATT 
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TTGCAAGAATTTTGCAACTGGTAAAGATATAAAACAAAATTCAGAAGGGAAAATTAAAGGATTTGTAAATAAGATT 
TTAGATCCAGTAAAGGATAAAATTGCTTCAAGTGGTACAAAAGTAGATGAAGTAGCAAAAAAATTACAAGAAGAAG 
AAAAAGAAGAATTAATGCAGGGCGATGATCCTAATGGCAGTGGAATAAATCCGCCACCAGTATTGCCGGAAAATAT 
TCACAATAATGCATTAGTATTAAAAGCAATAGAACAAAGTGATGGTCAACAAGAAAAAAAAGTAGAAGAAGCTGAA 
GCTAAAGTTGAAGAAAATAAAGAAAAACAAGAGAATACAGAAGAAAACATTAAAGAAAAAGAAATAATAGACGAAC 
AAAACAAACAAGAATTAGCTAAAGCTAAAGAAGAAGAACAACAAAAAGAACAAAAAAGACATCAAGAAGAGCAACA 
AAGAAAAGCTAAAGCAGAAAAAGAAAAAAGAGAAAGAGAAGAGGCAGAACAACAAAAACGACAACAAGAAGAGGAA 
GAAAAAAGGCAAGTTGATAACCAAATTAAAACACTTATAGCTAAAATAGATGAGATCAATGAAAATATTGATGTTA 
TAAAATGGCAAACGACTGTAGGCCCACAAGGCGTTATAGATAGAATTACTGGGCCTGTGTATGATGATTTTACCAA 
TGGCAATAATTCTATACGCGAAACTTGGGAGGGGTTAGAAGAGGAATCAGAAGACGAAGGATTAGGAAAATTATTG 
AAAGAATTGAGTGATGCTAGGGACGCGCTAAGAACTAAATTAAATGAAGGCAATAAACCATATACTGGTTACGAAG 
AGCCTAAGTTAAAAGAAAGTGTAAATGTTAGCGAAATTAAAGAAGATTTAGAAAAATTAAAATCAAAATTAGAAGA 
AGTTAAAAAATATCTTAAAGATAGTTCTAAATTTGAAGAAATTAAAGGATACATCAGTGACAGTCAG 

f45-2 .aa 

ERIIMNKKTL IICAVFALII SCKNFATGKD IKQNSEGKIK GFVNKILDPV KDKIASSGTK 
VDEVAKKLQE EEKEELMQGD DPNGSGINPP PVLPENIHNN ALVLKAIEQS DGQQEKKVEE 
AEAKVEENKE KQENTEENIK EKEIIDEQNK QELAKAKEEE QQKEQKRHQE EQQRKAKAEK 
EKREREEAEQ QKRQQEEEEK RQVDNQIKTL lAKIDEINEN IDVIKWQTTV GPQGVIDRIT 
GPVYDDFTNG NNSIRETWEG LEEESEDEGL GKLLKELSDA RDALRTKLNE GNKPYTGYEE 
PKLKESVNVS EIKEDLEKLK SKLEEVKKYL KDSSKFEEIK GYISDSQ 

t45-2 .aa 

CKNFATGKDIKQNSEGKIKGFVNKILDPVKDKIASSGTKVDEVAKKLQEEEKEELMQGDDPNGSGINPPPVLPENI 
HNNALVLKAIEQSDGQQEKKVEEAEAKVEENKEKQENTEENIKEKEIIDEQNKQELAKAKEEEQQKEQKRHQEEQQ 
RKAKAEKEKREREEAEQQKRQQEEEEKRQVDNQIKTLIAKIDEINENIDVIKWQTTVGPQGVIDRITGPVYDDFTN 
GNNSIRETWEGLEEESEDEGLGKLLKELSDARDALRTKLNEGNKPYTGYEEPKLKESVNVSEIKEDLEKLKSKLEE 
VKKYLKDSSKFEEIKGYISDSQ 

f47-2.nt 

TGAATATTAA TAATAAAAAA AGGAGTAACA ATGAAAATCA TCAACATATT ATTTTGTATA 
TCTTTGCTAC TACTAAATAG CTGTAATTCC AATGATAATG ACACTTTAAA AAACAATGCC 
CAACAAACAA AAAGCAGGAA AAAACGTGAT TTAAGCCAAG AAGAACTGCC ACAACAAGAA 
AAAATCACTT TAACATCCGA CGAAGAAAAA ATGTTTACTT CATTAATCAA TGTGTTTAAA 
TACACAATTG AAAAATTAAA CAATGAAATA CAAGGGTGCA TGAATGGAAA CAAAAGTAAA 
TGTAATGACT TCTTTGATTG GCTTTCTGAA GATATTCAAA AACAAAAAGA ATTAGCTGGT 
GCTTTTACCA AGGTTTACAA CTTCTTAAAA TCAAAAGCAC AAAATGAAAC TTTTGATACT 
TATATTAAAG GAGCTATTGA TTGTAAAAAA AACACTCCAC AAGATTGTAA TAAAAATAAT 
GAAATATGGG GAGGTGGACA ACTTANTAGN GCAATATTTT AG 

t47-2 .nt 

CTGTAATTCCAATGATAATGACACTTTAAAAAACAATGCCCAACAAACAAAAAGCAGGAAAAAACGTGATTTA^ 
CAAGAAGAACTGCCACAACAAGAAT^AAATCACTTTAACATCCGACGAAGAAAAAATGTTTACTTCATTAATCAATG 
TGTTTAAATACACAATTGAAAAATTAAACAATGAAATACAAGGGTGCATGAATGGAi^CAAAAGTA 
CTTCTTTGATTGGCTTTCTGAAGATATT 

CATy^T^CAT^AAAGAATTAGCTGGTGCTTTTACCAAGGTTTACAACTTCTTAAAATCAAAAGCACAAAATG 
TTGATACTTATATTAAAGGAGCTATTGATTGTAAAAAAAACACTCCACAAGATTGTAATAAAAATAATGAA 

f47-2.aa 



ILIIKKGVTM KIINILFCIS LLLLNSCNSN DNDTLKNNAQ QTKSRKKRDL SQEELPQQEK 
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ITLTSDEEKM FTSLIlSrVFKY TIEKLNNEIQ GCMNGNKSKC NDFFDWLSED IQKQKELAGA 
FTKVYNFLKS KAQNETFDTY IKGAIDCKKN TPQDCNKNNE IWGGGQLXXA IF 

t47-2 .aa 

CNSNDNDTLKNNAQQTKSRKKRDLSQEELPQQEKITLTSDEEKMFTSLIWFKYTIEKLNNEIQGCMNGNKSK^ 
FFDWLSEDIQKQKELAGAFTKVYNFLKSKAQNETFDTYIKGAIDCKKNTPQDCNKNNE 

f49-2 .nt 

TAAATGTTCA AAACAATCAT TAAACAAAAA AATATGAAAA AAATTTCAAG TGCAATTTTA 
TTAACAACTT TCTTTGTTTT TATTAATTGT AAAAGCCAAG TTGCTGATAA GGCGAGTGTG 
ACGGGGATTG CTAAGGGAAT AAAGGAGATT GTTGAAGCTG CTGGGGGGAG TGAAAAGCTG 
AAAGTTGCTG CTGCTGAAGG GGAGAATAAT GAAAAGGCAG GGAAGTTGTT TGGGAAGGCT 
GGTGCTGGTA ATGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGCGCAGGAT 
GGAGAGAAGC CTGGGGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTAATGAG 
GATGGTGCGG AGTTTAAGGA TGAGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGCTAAGGA TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGGCGAGTTG TTGGATAAGC TGGTAAAAGC TGTAAAGACA 
GCTGAGGGGG CTTCAAGTGG TACTGCTGCA ATTGGAGAAG TTGTGGCTGA TGATAATGCT 
GCGAAGGTTG CTGATAAGGC GAGTGTGAAG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTG GGGGGAGTAA AAAGCTGAAA GTTGCTGCTG CTAAAGAGGG CAATGAAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGCT GCTCATGCTG GGGACAGTGA GGCTGCTAGC 
AAGGCGGCTG GTGCTGTTAG TGCTGTTAGT GGGGAGCAGA TATTAAGTGC GATTGTTAAG 
GCTGCTGGTG CGGCTGCTGG TGATCAGGAG GGAAAGAAGC CTGGGGATGC TAAAAATCCG 
ATTGCTGCTG CTATTGGGAA GGGTGATGCG GAGAATGGTG CGGAGTTTAA TCATGATGGG 
ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG TGAAGAGTGG TGGTGGTGAG AAAGGGAAGG CTGAGGGGGC TATTAAGGGA 
GCTGCTGAGT TGTTGGATAA GCTGGTAAAA GCTGTAAAGA CAGCTGAGGG GGCTTCAAGT 
GGTACTGATG CAATTGGAGA AGTTGTGGCT AATGCTGGTG CTGCAAAGGT TGCTGATAAG 
GCGAGTGTGA CGGGGATTGC TAAGGGGATA AAGGAGATTG TTGAAGCTGC TGGGGGGAGT 
GAAAAGCTGA AAGTTGCTGC TGCTACAGGG GAGAGTAATA AAGGGGCAGG GAAGTTGTTT 
GGGAAGGCTG GTGCTGGTGC TAATGCTGGG GACAGTGAGG CTGCTAGCAA GGCGGCTGGT 
GCTGTTAGTG CTGTTAGTGG GGAGCAGATA TTAAGTGCGA TTGTTAAGGC TGCTGATGCG 
GCTGATCAGG AGGGAAAGAA GCCTGGGGAT GCTANAAATC CGATTGCTGC TGCTATTGGG 
AAGGGTNATG NGGAGAATGG TGCGGAGTTT AANNATGANG GATGA 

t49-2.nt 

TTGTAAAAGCCAAGTTGCTGATAAGGCGAGTGTGACGGGGATTGCTAAGGGAATAAAGGAGATTGTTGAAGCTGCT 

GGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGCTGAAGGGGAGAATAATGAAAAGGCAGGGAAGTTGTTTGG^ 

CTGGTGCTGGTAATGCTGGGGACAGTGAGGCTGCTAGCAAGGCGGCTGGTGCTGTTAGTGCTGTTAGTGGGGAGCA 

GATATTAAGTGCGATTGTTAAGGCTGCTGGTGAGGCTGCGCAGGATGGAGAGAAGCCTGGGGAGGCTAAAAATCCG 

ATTGCTGCTGCTATTGGGAAGGGTAATGAGGATGGTGCGGAGTTTAAGGATGAGATGAAGAAGGATGATCAGATTG 

CTGCTGCTATTGCTTTGAGGGGGATGGCTAAGGATGGAAAGTTTGCTGTGAAGAATGATGAGAAAGGGAAGGCTGA 

GGGGGCTATTAAG 

f49-2.aa 

MFKTIIKQKN MKKISSAILL TTFFVFINCK SQVADKASVT GIAKGIKEIV EAAGGSEKLK 
VAAAEGENNE KAGKLFGKAG AGNAGDSEAA SKAAGAVSAV SGEQILSAIV KAAGEAAQDG 
EKPGEAKNPI AAAIGKGNED GAEFKDEMKK DDQIAAAIAL RGMAKDGKFA VKNDEKGKAE 
GAIKGAGELL DKLVKAVKTA EGASSGTAAI GEWADDNAA KVADKASVKG lAKGIKEIVE 
AAGGSKKLKV AAAKEGNEKA GKLFGKVDAA HAGDSEAASK AAGAVSAVSG EQILSAXVKA 
AGAAAGDQEG KKPGDAKNPI AAAIGKGDAE NGAEFNHDGM KKDDQIAAAI ALRGMAKDGK 
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FAVKSGGGEK GKAEGAIKGA AELLDKLVKA VKTAEGASSG TDAIGEWAN AGAAKVADKA 
SVTGIAKGIK EIVEAAGGSE KLKVAAATGE SNKGAGKLFG KAGAGANAGD SEAASKAAGA 
VSAVSGEQIL SAIVKAADAA DQEGKKPGDA XNPIAAAIGK GXXENGAEFX XXG 

t49-2 .aa 

CKSQVADKASVTGIAKGI KE I VEAAGGSEKLKVAAAEGENNEKAGKLFGKAGAGNAGDSEAASKAAGAVSAVSGEQ 
ILSAIVKAAGEAAQI>3EKPGEAKNPIAAAIGKGNEDGAEFKDEMKKDIDQIAAAIALRGMAKDGKFAVKNDEKGK^ 
GAIK 

f5-14 .nt 

TAGAAATTCA AAACAAAGGA 
GCTGTTTTTG CACTTATAAT 
AATGCAAAAG GGAAAATTAA 
ATTACTTCAA GTAGTTCAAA 
GATAATGAAT TAATGCAGGG 
TTGCCGGAAA ATAGTCATGA 
GGTGGTCAAC AAGAAGACCA 
GAAGTTGTAG AGGAGAAAAA 
AGTCAAAAGC AAAAAGAAGA 
GCAAAAGCTA GAGCAGATAG 
CAACAGGAAG AAGCTAGGGT 
AAACAAGAAG AAGAAAAGAA 
GAAATAAATA AGGATATTGA 
ATAGATAAAA TTACGGGGCC 
AAAACTTGGG GAGATTTAGA 
TTGAGTGATA CTAGACATAA 
GTTCTAGAAA AGGAGCCTAA 
TTAGAAAAAT TAAAATCAGG 
TTTGAAGAAA TTAAAGGATA 

t5-14.nt 

TTGTAAAAATTATGCAACTGGTAAAGATATAAAACAAAATGCAAAAGGGAAAATTAAAGGATTTTTAGATAAGGTT 
TTAGATCCAGCAAAAGATAAAATTACTTCAAGTAGTTCAAAAGTAGATGAATTAGCAAAAAAATTACAAGAAGAAG 
ATGAAGATAATGAATTAATGCAGGGCGATGATCCTAATAACAGAGCAATAGCACTGTTACCAGTATTGCCGGAAAA 
TAGTCATGACAATCCACCAGTACCAAAAGTAAAAGCAGCAGCACAAAGTGGTGGTCAACAAGAAGACCAAAAAGCA 
'AAAGAATCTAAAGATAAAGTTGAGGAAGAAAAAGAAGTTGTAGAGGAGAAAAAAGAAGAACAAGATAGTAAAAAAG 
AAAAAGTGGAGAAGCAAAGTCAAAAGCAAAAAGAAGAAGAGAGAAACTCTAAAGAAGAACAACAAAAACAAGAAG 
AGCAAAAGCTAGAGCAGATAGAGAAAGAGAAGAACGACTAAAACAACAAGAACAAAAAAGACAACAGGAAGAAGCT 
AGGGTTAAAGCAGAAAAAGAAAAACAAGAAAGAGAGGAACAACAAAAACAAGAAGAAGAAAAGAAAGTTAAATATA 
AAATTAAAACACTTACAGACAAT^TAGATGAAATAAATAAGGATATTGATGGTATAAATGGTAAAACAATTGTAGG 
AGCAGAAGAAGTTATAGATAAAATTACGGGGCCTGTATATGATGATTTTACTGATGGGAATAAAGCTATATACA^ 
ACTTGGGGAGATTTAGAGGATGAAGAAGGCGAAGAATTAGGAAAATTATTGAAAGAATTGAGTGATACTAGACATA 
ATTTAAGAACCAAATTAAATGAGGGTAATAAAGCATATATTGTTCTAGAAAAGGAGCCTAATTTAAAAGAAAATGT 
AAATGTTAGTGATATTCAATCAGATTTAGAAAAATTAAAATCAGGATTAGAAGAAGTTAAAAAATATTTTGA^^ 
GAAGATAATTTTGAAGAAATTAAAGGATACATTGAGGATAGTAATTCATAT 

f5-14.aa 

KFKTKEKTKS MNKKILIIFA VFALIISCKN YATGKDIKQN AKGKIKGFLD KVLDPAKDKI 
TSSSSKVDEL AKKLQEEDED NELMQGDDPN NRAIALLPVL PENSHDNPPV PKVKAAAQSG 
GQQEDQKAKE SKDKVEEEKE WEEKKEEQD SKKEKVEKQS QKQKEEERNS KEEQQKQEEA 
KARADREREE RLKQQEQKRQ QEEARVKAEK EKQEREEQQK QEEEKKVKYK IKTLTDKIDE 
INKDIDGING KTIVGAEEVI DKITGPVYDD FTDGNKAIYK TWGDLEDEEG EELGKLLKEL 



GAAAAC/^AAA AGTATGAATA 
TTCTTGTAAA AATTATGCAA 
AGGATTTTTA GATAAGGTTT 
AGTAGATGAA TTAGCAAAAA 
CGATGATCCT AATAACAGAG 
CAATCCACCA GTACCAAAAG 
AAAAGCAAAA GAATCTAAAG 
AGAAGAACAA GATAGTAAAA 
AGAGAGAAAC TCTAAAGAAG 
AGAAAGAGAA GAACGACTAA 
TAAAGCAGAA AAAGAAAAAC 
AGTTAAATAT AAAATTAAAA 
TGGTATAAAT GGTAAAACAA 
TGTATATGAT GATTTTACTG 
GGATGAAGAA GGCGAAGAAT 
TTTAAGAACC AAATTAAATG 
TTTAAAAGAA AATGTAAATG 
ATTAGAAGAA GTTAAAAAAT 
CATTGAGGAT AGTAATTCAT 



AAAAAATATT GATTATTTTT 
CTGGTAAAGA TATAAAACAA 
TAGATCCAGC AAAAGATAAA 
AATTACAAGA AGAAGATGAA 
CAATAGCACT GTTACCAGTA 
TAAAAGCAGC AGCACAAAGT 
ATAAAGTTGA GGAAGAAAAA 
AAGAAAAAGT GGAGAAGCAA 
AACAACAAAA ACAAGAAGAA 
AACAACAAGA ACAAAAAAGA 
AAGAAAGAGA GGAACAACAA 
CACTTACAGA CAAAATAGAT 
TTGTAGGAGC AGAAGAAGTT 
ATGGGAATAA AGCTATATAC 
TAGGAAAATT ATTGAAAGAA 
AGGGTAATAA AGCATATATT 
TTAGTGATAT TCAATCAGAT 
ATTTTGAAAA TGAAGATAAT 
ATTGA 
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SDTRHNLRTK LNEGNKAYIV LEKEPNLKEN VNVSDIQSDL EKLKSGLEEV KKYFENEDNF 
EEIKGYIEDS NSY 

t5-14.aa 

CKl^ATGKDIKQNAKGKIKGFLDK\n^DPAiaDKITSSSSKVDELAKKLQEEDEDNEI^QGD 

SHDNPPVPKVKAAAQSGGQQEDQKAKESKDKVEEEKEWEEKKEEQDSKKEKVEKQSQKQKEEERNSKEEQQKQEE 
AKARADREREERLKQQEQKRQQEEARVKAEKEKQEREEQQKQEEEKKVKYKIKTLTDKIDEIl^DID^ 
AEEVIDKITGPVYDDFTDGNKAIYKTWGDLEDEEGEELGKLLKELSDTRHNLRTKLNEGNKAYIVLEKEPNLKENV 
NVSDIQSDLEKLKSGLEEVKKYFENEDNFEEIKGYIEDSNSY 

£5-15. nt 

TAACTTATGA ATAAGAAAAT GAAAATGTTT ATTATTTGTG 
TCTTGCAAGA ATTATGCAAG TGGTGAAAAT CTAAAAAATT 
TCAGAACAAA ATGTAAAAAA AACAGAACAA GAGATAAAAA 
GAAATTCTAG AGACAAAAGA TTTATCTAAA TTAGATGAAA 
AAACAAATTC AAGAATTAAA GAATAAAATA GAAAAATTAG 
GAAACATATT CTGAGTATGA AGAAAAAATA AACAAAATAA 
GGACTTGAAG ATAAATTTAA GGAGCTTGAA GAGAGTTTAG 
AAAAAAGCTT TACAAGAGGC CAAACAGAAA TTTGAAGAAT 
TCAACTGGGA AAACTCAAGG CGACAGGTCT AAAAACCGAG 
TGGCAGTGTG CCAATGAATT AGGTTTGGGT GTAAGTTATT 
AGCAATACTG ATGAATTAGC AAACAAAGTT ATAGATGATT 
GAACTTAAGG GAATAGAAGA AGATAAAAAA GAATAA 

t5-15.nt 

TTGCAAGAATTATGCAAGTGGTGAAAATCTAAAAAATTCAGAACAAAATCTAGAAAGTTCAGAACAAAATGTAAAA 
AAAACAGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTAGAAATTCTAGAGACAAAAGATTTATCTAAATTAG 
ATGAAAAAGATACAAAAGAAATTGAAAAACAAATTCAAGAATTAAAGAATAAAATAGAAAAATTAGATTCTAAAAA 
AACTTCTATTGAAACATATTCTGAGTATGAAGAAAAAATAAACAAAATAAAAGAAAAATTGAAAGGAAAAGGACTT 
GAAGATAAATTTAAGGAGCTTGAAGAGAGTTTAGCAAAGAAAAAGGGGGAGAGAAAAAAAGCTTTACAAGAGGCCA 
AACAGAAATTTGAAGAATATAAAAAACAAGTAGATACTTCAACTGGGAAAACTCAAGGCGACAGGTCTAAAAACCG 
AGGTGGTGTTGGAGTGCAAGCTTGGCAGTGTGCCAATGAATTAGGTTTGGGTGTAAGTTATTCTAATGGCGGCAGT 
GACAACAGCAATACTGATGAATTAGCAAACAAAGTTATAGATGATTCTCTTAAAAAGATTGAAGAAGAACTTAAGG 
GAATAGAAGAAGATAAAAAAGAA 

f5-15.aa 

LMNKKMKMFI ICAVFALMIS CKNYASGENL KNSEQNLESS EQNVKKTEQE IKKQVEGFLE 
ILETKDLSKL DEKDTKEIEK QIQELKNKIE KLDSKKTSIE TYSEYEEKIN KIKEKLKGKG 
LEDKFKELEE SLAKKKGERK KALQEAKQKF EEYKKQVDTS TGKTQGDRSK NRGGVGVQAW 
QCANELGIiGV SYSNGGSDNS NTDELANKVI DDSLKKIEEE LKGIEEDKKE 

t5-15 .aa 

CKNYASGENLKNSEQNLESSEQNVKKTEQEIKKQVEGFLEILETKDLSKLDEKDTKEIEKQIQELKNKIEKLDSKK 
TSIETYSEYEEKINKIKEKLKGKGLEDKFKELEESIiAKKKGERKKALQEAKQKFEEYKKQVDTSTGKTQGDRSKNR 
GGVGVQAWQCANELGLGVSYSNGGSDNSNTDELANKVIDDSLKKIEEELKGIEEDKKE 

f51-2.nt 

TAATTGTTTG GGGTTGTGGT AAACTTAAGG CTTATGGAGT GGATTATGAA TAAAAAAATG 

AAAATATTTA TTATTTGTGC TGTATTTGTG CTGATAAGTT CTTGCAAGAT TGATGCAACT 

GGTAAAGATG CAACTGGTAA AGATGCAACT GGTAAAGATG CAACTGGTAA AGATGCAACT 

GGTAAAAATG CAGAACAAAA TATAAAAGGG AAAGTTCAAG GATTTTTAGA AAAGATTTTA 




PCT/US98/12718 



CTGTTTTTGC 
CAGAACAAAA 
AACAAGTTGA 
AAGATACAAA 
ATTCTAAAAA 
AAGAAAAATT 
CAAAGAAAAA 
ATAAAAAACA 
GTGGTGTTGG 
CTAATGGCGG 
CTCTTAAAAA 



ATTGATGATT 
TCTAGAAAGT 
AGGATTTTTA 
AGAAATTGAA 
AACTTCTATT 
GAAAGGAAAA 
GGGGGAGAGA 
AGTAGATACT 
AGTGCAAGCT 
CAGTGACAAC 
GATTGAAGAA 
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GATCCAGTAA AGGATAAAAT TGCTTCAAAT GGTCCAATAG CAGATGAATT GGCAAAAAAA 
TTACAAGAAG AAGAAAAGGT AAATAACGGG GAAGAAGAAA ATGATAAAGC TGTCTTTTTA 
GGAGAAGAAT CAAAAGAGGA TGAAGAAGAA AATGAGCAAG CTGTTAATTT AGAAGAAAAA 
AATGCGGAAG AGGATAAGAA AGTTGTTAAT TTAGAAGAGA AAGAATTAGA AGTTAAAAAA 
GAGACTGAAG AAGATGAAGA TAAAGAAGAA ATAGAGAAAC AAAAACAAGA AGTGGAAAAA 
GCACAAGAAA GAAAACAACG ACAAGAAGAA AAGAAACGAA AAAAACAAGA ACAGCAAGAA 
GAAAAGAAAC GAAAACGACA AGAACAAAGA AAAGAAAGGA GAGCTAAAAA CAAAATTAAA 
AAACTTGCGG ATAAAATAGA TGAGATAAGT TGGAATATTG ATGGTATAGA AAGTCAAACA 
AGTGTAAAAC CGAAAGCAGT TATAGATAAA ATTACGGGGC CTGTATATGA TTATTTTACC 
GATGACAACA AAAAAGCTAT ATATAAAACA TGGGGAGATT TAGAAGATGA AGAAGGCGAA 
GGATTGGGAA AATTATTGAA AGAATTGAGT GATACTAGAG ATGAGTTAAG AACCAAATTA 
AATAAAGATA ATAAAAAATA TTATGCCCAT GAAAATGAGC CTCCTCTAAA AGAAAATGTA 
GATGTCAGCG AAATTAAAGA AGATTTAGAA AAAGTAAAAT CAGGATTAGA AAAGGTTAAA 
GAATATCTTA AAGACAATTC TAAATTTGAA GAAATTAAAG GATACATCAG TTACAGTCAG 
TAA 

t51-2 .nt 



TTGCAAGATTGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCA 
ACTGGTAAAAATGCAGAACAAAATATAAAAGGGAAAGTTCAAGGATTTOTAGAAAAGATTTTAGATCCAGTAAAGG 
ATAAAATTGCTTCAAATGGTCCAATAGCAGATGAATTGGCAAAAAAATTACAAGAAGAAGAAAAGGTAAATAACGG 
GGAAGAAGAAAATGATAAAGCTGTCTTTTTAGGAGAAGAATCAAAAGAGGATGAAGAAGAAAATGAGCAAGCTGTT 
AATTTAGAAGAAAAAAATGCGGAAGAGGATAAGAAAGTTGTTAATTTAGAAGAGAAAGAATTAGAAGTTAAAAAAG 
AGACTGAAGAAGATGAAGATAAAGAAGAAATAGAGAAACAAAAACAAG7VAGTGG7WVAAAGCACAAGAAAGAAAACA 
ACGACAAGAAGAAAAGAAACGAAAAAAACAAGAACAGCAAGAAGAAAAGAAACGAAAACGACAAGAACAAAGAAAA 
GAAAGGAGAGCTAAAAACAAAATTAAAAAACTTGCGGATAAAATAGATGAGATAAGTTGGAATATTGATGGTATAG 
AAAGTCAAACAAGTGTAAAACCGAAAGCAGTTATAGATAAAATTACGGGGCCTGTATATGATTATTTTACCGATGA 
CAACAAAAAAGCTATATATAAAACATGGGGAGATTTAGAAGATGAAGAAGGCGAAGGATTGGGAAAATTATTGAAA 
GAATTGAGTGATACTAGAGATGAGTTAAGAACCAAATTAAATAAAGATAATAAAAAATATTATGCCCATGAAAATG 
AGCCTCCTCTAAAAGAAAATGTAGATGTCAGCGAAATTAAAGAAGATTTAGAAAAAGTAAAATCAGGATTAGAAAA 
GGTTAAAGAATATCTTAAAGACAATTCTAAATTTGAAGAAATTAAAGGATACATCAGTTACAGTCAG 

f51-2.aa 



LFGVWNLRL MEWIMNKKMK IFIICAVFVL 
KNAEQNIKGK VQGFLEKILD PVKDKIASNG 
EESKEDEEEN EQAVNLEEKN AEEDKKWNL 
QERKQRQEEK KRKKQEQQEE KKRKRQEQRK 
VKPKAVIDKI TGPVYDYFTD DNKKAIYKTW 
KDNKKYYAHE NEPPLKENVD VSEIKEDLEK 



ISSCKIDATG KDATGKDATG KDATGKDATG 
PIADELAKKL QEEEKVNNGE EENDKAVFLG 
EEKELEVKKE TEEDEDKEEI EKQKQEVEKA 
ERRAKNKIKK LADKIDEISW NIDGIESQTS 
GDLEDEEGEG LGKLLKELSD TRDELRTKLN 
VKSGLEKVKE YLKDNSKFEE IKGYISYSQ 



t51-2 -aa 



CKIDATGKDATGKDATGKDATGKDATGKNAEQNIKGKVQGFLEKILDP\^DKIASNGPIADEIaAKKLQEEEKVi™ 
EEENDKAVFLGEESKEDEEENEQAVNLEEKNAEEDKKVVNLEEKELEVKKETEEDEDKEEIEKQKQEVEKAQERKQ 
RQEEKKRKKQEQQEEKKRKRQEQRKERRAKNKIKKUADKIDEISWNIDGIESQTSVKPKAVIDKITGPVYDYFTDD 
NKJCAIYKTWGDLEDEEGEGLGKLIiKELSOTRDELRTKLNKDNKKYYAHENEPPLKENVDVSEIKEDLEKVKSGLEK 
VKEYLKDNSKFEEI KG YI S YSQ 



f6-21.nt 



TAGGCAAAAT TTAAATTTAT 
AAGTTAAAAG TTGTATTATT 
AATAGAAACA AATTGGTTTT 
CAATTAATAA ACGATACGGT 
GATGGAGATC CCAGGACTGG 



AAAAACTTGT AAGGATGCTT 
TCTCAATTTA ATTTTACTTA 
TAAGCTAAAT ATTGGAAGTG 
TGGATCAGGG ATTGTAAGCC 
AGGATACAGA CCGGGACTTG 



GTATGAAAAT ATTGATAAAA 
TTTCTTGTGT TAATGAAAGT 
AGCCTGCTAC TTTAGATGCT 
AAATGTTTCT TGGCATTTTA 
CTAAAAGTTG GGATATTTCT 
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GATGACGGAG TAGTTTATAC GTTTCATTTA AGAGATAATC TTGTTTGGAG TGATGGAGTT 
TCCATTACTG CCGAAGAATA A 

t6-21 .nt 

TTGTGTTAATGAAAGTAATAGAAACAAATTGGTTTTTAAGCTAAATATTGGAAGTGAGCCTGCTACTTTAGATGCT 
CAATTAATAAACGATACGGTTGGATCAGGGATTGTAAGCCAAATGTTTCTTGGCATTTTAGATGGAGATCC 
CTGGAGGATACAGACCGGGACTTGCTAAAAGTTGGGATATTTCTGATGACGGAGTAGTTTATACGTTTCATTTAAG 
AGATAATCTTGTTTGGAGTGATGGAGTTTCCATTACTGCCGAAGAA 

f6-21.aa 

AKFKFIKTCK DACMKILIKK LKWLFLNLI LLISCVNESN RNKLVFKLNI GSEPATLDAQ 
LINDTVGSGI VSQMFLGILD GDPRTGGYRP GLAKSWDISD DGWYTFHLR DNLVWSDGVS 
ITAEE 

t6-21 .aa 

CVNESNRNKLVFKLNIGSEPATLDAQLINDTVGSGIVSQMFLGILDGDPRTGGYRPGLAKSWDISDDG 
DNLVWSDGVS ITAEE 

f6-27 .nt 

TAAAGAAAAG CTTGCATAAA AAGTATAACA AATTCTTTAA TAATTAAAAT CAAAAAGAAT 
ATAATTATTG CACTAAAATT AAATTTATAC AGTTATATAG AATCACTTAA GGAACAAAAA 
ATGAAATACC TTAAAAACAT TTCCTTATTT TTGTTAATTT TAGGTTGCAA ATCCATCCCA 
AATGGTAATT TCAATCTACA CGATACAAAC CATAAATTAG GAAAACTAAA ATTTCAAGAA 
GACTCGATAA TAAGCAGAAA TTATGATAAT AAAATATCCA TTGTGGGAGT ATACAACCCT 
TTAACAGAAA AAGAAAATTT TAAAGTCAAT ATTTTCATCA AAAAAAAAGG ATTACAAATA 
GATCCTGAAA ATATTTTGAT AAATGAAGAA AAAATTAATT ATTCAAAATA TAAAGCAGAA 
CTCAAAGTAA AATCTAGCTT TAATAAAAGC ATTATCAGTA TTTCACTAAC TAATTCAAGA 
GATCTATTAA CCTACATTTA CGATAAAAGC ACAGGGAAAT ACATTAACAT TGACTTTAAG 
GACAATTGGA ACGTATCGCA CAGTATAAAA TTTAATAAGG AGTATATTTT AGCATATATA 
ACAGATTTTG ATAAAGAAAT TAAAATATCT AAAAATATTT TGCAAAAACG TATTGATAAT 
AGAAAAATTG AAATTGAAAA AACAGAGCTT AAAACAGAAT ATAATGAAAT AGAGGATTAT 
TACATCTACA GTATGAAAAT TCCAAAATTA TTTGAAAAAT CAGACGCTCC CTCTGAAACT 
TACGAAACAT TTGTTATAGC AAATTATTAC CCCTGTGAAA ATTTAAATAT ACTGTTTTTG 
AATTTAAGCT TATACTCTGA TAAATTACGC TTTCTAAACT CTATTTATGA TGAGAATGAT 
AGAAAATTAA AAATGGAGCC TCCTGTGAGA GCCTTAAAGA ATTCAAAAAC AATAAAAGAA 
ACATTAAATA TAGTATTAAG TCCTCAAAAA ATAATAGAGC TAGCAAAAAA CATTGAAAAA 
GATATTACTC TAAAATTAAA ATCTTACGGA GAAAAGGGAG AATTCACATT TGAAATATAT 
AAACCACTTC TTTTAAAATT CTTAAAAGAA GTAGATCATT GCATAAAAAA TTTGCAATCA 
AGTAGGCATA AATTTTAA 

t6-27 .nt 

TTGCAAATCCATCCCAAATGGTAATTTCAATCTACACGATACAAACCATAAATTAGGAAAACTAAAAT^ 

GACTCGATAATAAGCAGAAATTATGATAATAAAATATCCATTGTGGGAGTATACAACCCTTTAACAGAAAAAGAAA 

ATTTTAAAGTCAATATTTTCATCAAAAAAAAAGGATTACAAATAGATCCTGAAAATATTTTGATAAATGAAG^^ 

AATTAATTATTCAAAATATAAAGCAGAACTCAAAGTAAAATCTAGCTTTAATAAAAGCATTATCAGTATTTCACTA 

ACTAATTCAAGAGATCTATTAACCTACATTTACGATAAAAGCACAGGGAAATACATTAACATTGACTTTAAGGACA 

ATTGGAACGTATCGCACAGTATAAAATTTAATAAGGAGTATATTTTAGCATATATAACAGATTTTGATAAAGAAAT 

TAAAATATCTAAAAATATTTTGCAAAAACGTATTGATAATAGAAAAATTGAAATTGAAAAAAC^^ 

GAATATAATGAAATAGAGGATTATTACATCTACAGTATGAAAATTCCAAAATTATTTGAAAAATCAGACGCTCCCT 

CTGAAACTTACGAAACATTTGTTATAGCAAATTATTACCCCTGTGAAAATTTAAATATACTGTTTTO 

CTTATACTCTGATAAATTACGCTTTCTAAACTCTATTTATGATGAGAATGATAGAAAATTAAAAATGGAGCCTCC 
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GTGAGAGCCTTAAAGAATTCAAAAACAATAAAAGAAACATTAAATATAGTATTAAGTCCTCAAAAAATAATAGAGC 
TAGCAAAAAACATTGAAAAAGATATTACTCTAAAATTAAAATCTTACGGAGAAAAGGGAGAATTCACATTTGAAAT 
ATATAAACCACTTCTTTTAAAATTCTTAAAAGAAGTAGATCATTGCATAAAAAATTTGCAATCAAGTAGGCATAAA 
TTT 

f 6-27.aa 

RKACIKSITN SLIIKIKKNI IIALKLNLYS YIESLKEQKM KYLKNISLFL LILGCKSIPN 
GNFNLHDTNH KLGKLKFQED SIISRNYDNK ISIVGVYNPL TEKENFKVNI FIKKKGLQID 
PENILINEEK INYSKYKAEL KVKSSFNKSI ISISLTNSRD LLTYIYDKST GKYINIDFKD 
NWNVSHSIKF NKEYILAYIT DFDKEIKISK NILQKRIDNR KIEIEKTELK TEYNEIEDYY 
lYSMKIPKLF EKSDAPSETY ETFVIANYYP CENLNILFLN LSLYSDKLRF LNSIYDENDR 
KLKMEPPVRA LKNSKTIKET LNIVLSPQKI lELAKNIEKD ITLKLKSYGE KGEFTFEIYK 
PLLLKFLKEV DHCIKNLQSS RHKF 

t6-27 .aa 

CKSIPNGNFNLHDTNHKLGKLKFQEDSIISRNYDNKISIVGVYNPLTEKENFKVNIFIKKKGLQIDPENILINEEK 

INYSKYKAELKVKSSFNKSIISISLTNSRDLLTYIYDKSTGKYINIDFKDNWNVSHSIKFNKEYILAYITDFDKEI 

KISKNILQKRIDNRKIEIEKTELKTEYNEIEDYYIYSMKIPKLFEKSDAPSETYETFVIAJSTYYPCENLNILFLNLS 

LYSDKLRFLNSIYDENDRKLKMEPPVRALKNSKTIKETLNIVLSPQKIIELAKNIEKDITLKLKSYGEKGEFT 

YKPLLLKFLKEVDHCIKNLQSSRHKF 

f 6-5.nt 

TAAATGAAGA AGTTTTTAAT ATCCGTTTAT TTTTTATTGT TTTATGGTTG TTCAACTATA 
TCTTTGGTAA AAATACCAGA AAAAGATAAA ATAAATTTAA CTGTTTTATC ATCTTTAATG 
AATTATCCTG ATTTGAAGAT TTCAAATTTT AAAATAAAAG ACTACGAACA TTTGCATTAT 
TCATCTGATT TTGAAAGCTT GAGTGATACT AAAAATAGTG CTTATATTTA CGTTGATGAA 
TCTAGTTTCA ATAATAATAT TAATTTTATT AAAGATCTTT TTATTTATAA TAAGAAATTA 
TATAGAATAC TTATTGCTTA TAGCTTGACC CAAGGTGCAT CTTTTAAGGC AGAAGTTTTA 
TCTTATCTTG AAAAACAAAA AATTATGAAA AATTTTTCAT TGAAAATAAA TTTTCCAACT 
GCTAAAAAAT TTATGGATAA TAAGTATTGG ATTGTAATTG CAAAAAACCA TTTAGATTCT 
CTTGTTAAGA GTAAAAATTA TTTAGTCTTG GCGAATGTAA AGATGGAATA TATACTCAAA 
AAGTTTTTAA CTTGA 

t6-5.nt 

TTGTTCAACTATATCTTTGGTAAAAATACCAGAAAAAGATAAAATAAATTTAACTGTTTTATCATCTTTAATGAAT 
TATCCTGATTTGAAGATTTCAAATTTTAAAATAAAAGACTACGAACATTTGCATTATTCATCTGATTTTGAAAGCT 
TGAGTGATACTAAAAATAGTGCTTATATTTACGTTGATGAATCTAGTTTCAATAATAATATTAATTTTATTAAAGA 
TCTTTTTATTTATAATAAGAAATTATATAGAATACTTATTGCTTATAGCTTGACCCAAGGTGCATCTTTTAAGGCA 
GAAGTTTTATCTTATCTTGAAAAACAAAAAATTATGAAAAATTTTTCATTGAAAATAAATTTTCCAACTGCTAA^ 
AATTTATGGATAATAAGTATTGGATTGTAATTGCAAAAAACCATTTAGATTCTCTTGTTAAGAGTAAAAAT 

£6-5 .aa 

MKKFLISVYF LLFYGCSTIS LVKIPEKDKI NLTVLSSLMN YPDLKISNFK IKDYEHLHYS 
SDFESLSDTK NSAYIYVDES SFNNNINFIK DLFIYNKKLY RILIAYSLTQ GASFKAEVLS 
YLEKQKIMKN FSLKINFPTA KKFMDNKYWI VIAKNHLDSL VKSKNYLVLA NVKMEYILKK 
FLT 

1 6 - 5 . aa 

CSTISLVKIPEKI5KINLTVLSSLMSnfPDLKISNFKIKDYEHLHYSSDFESLSiyrKNSAYIY\^ 
LFIYNKKLYRILIAYSLTQGASFKAEVLSYLEKQKIMKNFSLKINFPTAKKFMDNKYWIVIAK^ 
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£7-30. nt 

TAGAGACGAA GTCACAAGCA AAATGTTAAA AGATTTACAA AATCAAGTTC AAGGGGGCAA 
ATAATGAAAA ATTTAAAGAC AAAAATTAAT TTTTTAGGGA TATTTTGGCT ACTGTTACTA 
TTTCTTTCTT GCGAATCAAT ACCATCACTT CCCCAAAAAC CAACCCTAAC AAACAAAGAA 
GATATTGAAA ATTTAATGCT CGATGAAGCA GAACTTTTTA GATACTCAAC CGCACTAAAT 
GTTTGGCTTT TGACTGTAAA ATCTTATGTG ATCAAATACT ATCCTAATGA CAAATTTCCT 
GTGTTTGAAA ATTTTGATCC CGTGTTTGGC GATGAAAATG GAACTAAAGA AACAAATATA 
CTAAAAAATC GAATTACCTA CTACAATCGA TACATAGAAA AAACCGAACC GATTGTATTT 
GGGTGTTACA AAAAATACAG CAGAAGATAA 

t7-30.nt 

TTGCGAATCAATACCATCACTTCCCCAAAAACCAACCCTAACAAACAAAGAAGATATTGAAAATTTAATGCTCGAT 
GAAGCAGAACTTTTTAGATACTCAACCGCACTAAATGTTTGGCTTTTGACTGTAAAATCTTATGTGATCAAATACT 
ATCCTAATGACAAATTTCCTGTGTTTGAAAATTTTGATCCCGTGTTTGGCGATGAAAATGGAACTAAAGAAACAAA 
TATACTAAAAAATCGAATTACCTACTACAATCGATACATAGAAAAAACCGAACCGATTGTATTTGGGTGTTACAAA 
AAATACAGCAGAAGA 

£7-30. aa 

RRSHKQNVKR FTKSSSRGQI MKNLKTKINF LGIFWLLLLF LSCESIPSLP QKPTLTNKED 
lENLMLDEAE LFRYSTALNV WLLTVKSYVI KYYPNDKFPV FENFDPVFGD ENGTKETNIL 
KNRITYYNRY lEKTEPIVFG CYKKYSRR 

t7-30 .aa 

CESIPSLPQKPTLTNKEDIENLMLDEAELFRYSTALNVWLLTVKSYVIKYYPNDKFPVFENFDPVFGDENGTKETN 
ILKNRITYYNRYIEKTEPIVFGCYKKYSRR 

£76-1. nt 

TGAATATTAA TAATAAAAAA AGGAGTAACA ATGAAAATTA TCAACATATT ATTTTGTTTG 
TTTTTACTAA TGCTAAACGG CTGTAATTCT AATGATACAA ATACCAAGCA GACAAAAAGC 
AGACAAAAGC GTGATTTAAC CCAAAAAGAA GCAACACAAG AAAAACCTAA ATCTAAATCT 
AAAGAAGACC TGCTTAGAGA AAAGCTATCT GATGATCAAA AAACACAACT TGACTGGTTA 
AAAACCGCTT TAACTGGTGT TGGAAAATTT GATAAATTCT TAGAAAATGA TGAAGGCAAA 
ATTAAATCAG CACTTGAACA TATAAAGACT GAACTTGATA AATGTAATGG AAATGATGAA 
GGAAAAAACA CCTTCAAAAC TACCGTTCAA GGGTTTTTTA GCGGCGGCAA TATAGATAAT 
TTTGCAGATC AAGCAACTGC TACCTGCAAT TAA 

t76-l.nt 

CTGTAATTCTAATGATACAAATACCAAGCAGACAAAAAGCAGACAAAAGCGTGATTTAACCCAAAAAGAAGCAACA 

CAAGAAAAACCTAAATCTAAATCTAAAGAAGACCTGCTTAGAGAAAAGCTATCTGATGATCAAAAAACACAACTTG 

ACTGGTTAAAAACCGCTTTAACTGGTGTTGGAAAATTTGATAAATTCTTAGAAAATGATGAAGGCAAAATTAA^ 

AGCACTTGAACATATAAAGACTGAACTTGATAAATGTAATGGAAATGATGAAGGAAAAAACACCTTCAA^ 

GTTCAAGGGTTTTTTAGCGGCGGCAATATAGATAATTTTGCAGATCAAGCAACTGCTACCTGCAAT 

£76-1. aa 

ILIIKKGVTM KIINILFCLF LLMLNGCNSN DTNTKQTKSR QKRDLTQKEA TQEKPKSKSK 
EDLLREKLSD DQKTQLDWLK TALTGVGKFD KFLENDEGKI KSALEHIKTE LDKCNGNDEG 
KNTFKTTVQG FFSGGNIDNF ADQATATCN 



t76-l .aa 
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CNSNDTNTKQTKSRQKRDLTQKEATQEKPKSKSKEDLLREKLSDDQKTQLDWLKTALTGVGKFDKFLENDEGKIKS 
ALEHIKTELDKCNGNDEGKNTFKTTVQGFFSGGNIDNFADQATATCN 

fS-lO.nt 

TAAGTAAGGA GAATATTTAT GAAATATAAT ACGATTATAA GCATATTTGT TTGTTTGTTT 
TTAACTGCTT GCAATCCAGA TTTTAACACA AATAAGAAAA GAACTCTAAG TAAGGGGATA 
ATTTCAAATC AAGATGCAGA TTCTGATAAA ATAATAAAAA ATAAATTACT TGATGATTTA 
ATAAATTTAA TAGAAAAAGC GAATGCAGAT AGAGAAAAAT ATGTAAAAAA AATGGAAGAA 
GAACCTTCGG ATCAATATGG AATGTTGGCT GTTTTTGGAG GTATGTATTG GGCAGAATCA 
CCACGGGAAT TAATATCTGA TACAGGTAGT GAGAGATCTA TTAGGTATAG AAGGCGTGTT 
TATAGTATTT TATTAAATGC TATTGAAACT AATGAATTAA AGAAATTTTC AGAAATTAGA 
ATACTGTCAA TAAAAGTACT AGAAATATTT AGCCTATTTA ATCTATTTGG AAGTACTCTT 
GATGATGTGG TTGTTCACTT ATATTCCAAA AAAGATACTC TAGGTAAACT AGATATTTCA 
AATTTAAAAA GACTTAAAAA TTTGTTTGAA AAATTATTAT CTATAAAAAC AATCGTTTCA 
AAGATGTCAA AACGTCTTTT ATTGGATTAT CAAAATAATG AAAATTTTAT AAAAACAGAT 
AACGCCAAGC TTGGATCTTA TGTGGTTGCA CTTTCCAATC AAATTCAAGA AAAATATAAT 
GAAGCAGAAA GGCTGAAAAG CGAGATAATT TTAATATATA CCCTTTAA 

t8-10 .nt 

TTGCAATCCAGATTTTAACACAAATAAGAAAAGAACTCTAAGTAAGGGGATAATTTCAAATCAAGATGCAGATTCT 
GATAAAATAATAAAAAATAAATTACTTGATGATTTAATAAATTTAATAGAAAAAGCGAATGCAGATAGAGAAAAAT 
ATGTAAAAAAAATGGAAGAAGAACCTTCGGATCAATATGGAATGTTGGCTGTTTTTGGAGGTATGTATTGGGCAGA 
ATCACCACGGGAATTAATATCTGATACAGGTAGTGAGAGATCTATTAGGTATAGAAGGCGTGTTTATAGTATTTTA 
TTAAATGCTATTGAAACTAATGAATTAAAGAAATTTTCAGAAATTAGAATACTGTCAATAAAAGTACTAGAAATAT 
TTAGCCTATTTAATCTATTTGGAAGTACTCTTGATGATGTGGTTGTTCACTTATATTCCAAAAAAGATACTCTAGG 
TAAACTAGATATTTCAAATTTAAAAAGACTTAAAAATTTGTTTGAAAAATTATTATCTATAAAAACAATCGTTTCA 
AAGATGTCAAAACGTCTTTTATTGGATTATCAAAATAATGAAAATTTTATAAAAACAGATAACGCCAAGCTTGGAT 
CTTATGTGGTTGCACTTTCCAATCAAATTCAAGAAAAATATAATGT^GCAGAAAGGCTGAAA 

f8-10 .aa 

VRRIFMKYNT IISIFVCLFL TACNPDFNTN KKRTLSKGII SNQDADSDKI IKNKLLDDLI 
NLIEKANADR EKYVKKMEEE PSDQYGMLAV FGGMYWAESP RELISDTGSE RSIRYRRRVY 
SILLNAIETN ELKKFSEIRI LSIKVLEIFS LFNLFGSTLD DVWHLYSKK DTLGKLDISN 
LKRLKNLFEK LLSIKTIVSK MSKRLLLDYQ NNENFIKTDN AKLGSYWAL SNQIQEKYNE 
AERLKSEIIL lYTL 

t8-10 .aa 

CNPDFNTNKKRTLSKGIISNQDADSDKIIKNKLLDDLINLIEKANADREKYVKKMEEEPSDQYGMIJVVFGG^^ 
SPRELISOTGSERSIRYRRRVYSILI^AIETNELKKFSEIRILSIKVLEIFSLFNLFGSTLDDVVVHLYSKKOT 
KLDISNLKRLKNLFEKLLSIKTIVSKMSKRLLLDYQNNENFIKTDNAKLGSYVVALSNQIQEKYNEAERLK 

f8-14.nt 

TAATATATAT TCTTGATTAA GGGAAAGGAG AGTATTTTTA TGAAAAAAAA AATGTTTTTA 
TATACATTGT TAACGATAGG ATTGATGTCT TGTAATCTAA ATTCTAAATT ATCTGGTAAT 
AAAGAGGAAC AAAAAAATAA CAATGATATA AAAGAAGCTT TAAATGGCGT TCAAGAAAAT 
GCTATTAATA ATTTATATGG AAATAAAAAA GAAAAAAAAG ATTTTATTAA AAATTCGGAA 
AAATTGAAAG ACAAGGGTTT AGACGTGACC ACCCTCCCCT TAGAACCTGT AGTGGCGCCC 
TCCGTAGAAT CTGCGGTGTC TTTAGGAGAA TCTAATAATA GGATTGGTAT ACCAACCATT 
TCAATTGAGC ATAATCAAAA AAAAGAGATA AAAGAAGAGG ATTTTTTCCC TTCTACTGAG 
GAAGAAAAGC AAGCGGATAA AGCAATTAAA GATATAGAGA ATCTTATTGG AGAATCTGGA 
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TTTCCCGAGT TAATTGAGAA TGTGTGCTCA CTTAAACATG AATATACTTT AATAAGAAGT 
GATTTTTATG ATGTGATAAC TAAGATTCAG AATAAAAAAA TATCACTAAT GAAAAATTCT 
CATAATAATA GAAATAAAAT AAGGGAACTA GTACAATTGC AAAATAATTT AAAGATAGGA 
GACGAACTTG ATAAAATTAT GGGTTGCATT GATACTGCAG AACAAGAGAT AAGATCTGCC 
GCTTTCTTTT TTGATGAAGC TAAGGAAAGC TTAAAAGAAG GTATTATTAA AAGATTGGAA 
AAAAGTAAAA ATAGGGCAGC ATCACAATTA TCTAAAAAGG CTTTAAATAG AGCAGAGGAT 
GCTTTAAGGT GCTTAGAAAA TTATTCTTCT AAAAAAGGTG AGGCAATAGG AAGAAGAAGC 
TTTATAAAAG AAGTTGTTGA ACAGGCAAAA AATGCTTTAA GTAAGTCTTA A 

t8-14 .nt 

TTGTAATCTAAATTCTAAATTATCTGGTAATAAAGAGGAACAAAAAAATAACAATGATATAAAAGAAGCTTTAAAT 
GGCGTTCAAGAAAATGCTATTAATAATTTATATGGAAATAAAAAAGAAAAAAAAGATTTTATTAAAAATTCGGAAA 
AATTGAAAGACAAGGGTTTAGACGTGACCACCCTCCCCTTAGAACCTGTAGTGGCGCCCTCCGTAGAATCTGCGGT 
GTCTTTAGGAGAATCTAATAATAGGATTGGTATACCAACCATTTCAATTGAGCATAATCAAAAAAAAGAGATAAAA 
GAAGAGGATTTTTTCCCTTCTACTGAGGAAGAAAAGCAAGCGGATAAAGCAATTAAAGATATAGAGAATCTTATTG 
GAGAATCTGGATTTCCCGAGTTAATTGAGAATGTGTGCTCACTTAAACATGAATATACTTTAATAAGAAGTGATTT 
TTATGATGTGATAACTAAGATTCAGAATAAAAAAATATCACTAATGAAAAATTCTCATAATAATAGAAATAAAATA 
AGGGAACTAGTACAATTGCAAAATAATTTAAAGATAGGAGACGAACTTGATAAAATTATGGGTTGCATTGATACTG 
CAGAACAAGAGATAAGATCTGCCGCTTTCTTTTTTGATGAAGCTAAGGAAAGCTTAAAAGAAGGTATTATTAAAAG 
ATTGGAAAAAAGTAAAAATAGGGCAGCATCACAATTATCTAAAAAGGCTTTAAATAGAGCAGAGGATGCTTTAAGG 
TGCTTAGAAAATTATTCTTCTAAAAAAGGTGAGGCAATAGGAAGAAGAAGCTTTATAAAAGAAGTTGTTGAACAGG 
CAAAAAATGCTTTAAGTAAGTCT 

f8-14.aa 

YIFLIKGKES IFMKKKMFLY TLLTIGLMSC NLNSKLSGNK EEQKNNNDIK EALNGVQENA 

INNLYGNKKE KKDFIKNSEK LKDKGLDVTT LPLEPWAPS VESAVSLGES NNRIGIPTIS 

lEHNQKKEIK EEDFFPSTEE EKQADKAIKD lENLIGESGF PELIENVCSL KHEYTLIRSD 

FYDVITKIQN KKISLMKNSH NNRNKIRELV QLQNNLKIGD ELDKIMGCID TAEQEIRSAA 

FFFDEAKESL KEGIIKRLEK SKNRAASQLS KKALNRAEDA LRCLENYSSK KGEAIGRRSF 
IKEWEQAKN ALSKS 

t8-14.aa 

CNLNSKLSGNKEEQKNNNDIKEALNGVQENAIKnSTLYGNKKEKKDFIKNSEKLKDKGLDVTTLPLEPW 
SLGESNNRIGIPTISIEHNQKKEIKEEDFFPSTEEEKQADKAIKDIENLIGESGFPELIENVCSLKHEYTLIRSDF 
YDVITKIQNKKISLMKNSHNNRNKIRELVQLQNNLKIGDELDKIMGCIDTAEQEIRSAAFFFDEAKESLKEGIIKR 
LEKSKNRAASQLSKKALNRAEDALRCLENYSSKKGEAIGRRSFIKEWEQAKNALSKS 

fOlA.nt BBOOl 

TGATTAATTTTTTTTAAGGATTACGTTTTGAAAAGAAACAAAATTTGGAAAACGTTAAAACTGTTTCAAATA^ 

TACTGTTCTCATGCTCTTTTTATTCTAAATCAAACAACACAGAAGCGATAAGTGAATTACAATCAAGCCCTATT^ 

ACTTGGAAAAATTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACCCAAAATCTTCAAAACTTACAACAAAGC 

CAGTTCTTTAAAAATGAAAAAGAAA?\AATAATTAAAAAAATTGCACAAGAATTTGATGAGAATGAAAAATTGATT^ 

ATAAAATAGGTCCAAATATCGAAATGTTTGCTCAAACAATAAACACGGATATTCAAAAAATCGAACCTAATGATCA 

ATTTGGAATAAATAAAACTTTATTCACAGAAAAAAAAGACAATAATATTGACTTTATGTTAAAAGACAATCGACTT 

AGAAGATTATTTTACTCATCTTTAAATTATGATGAAAATAAAATCAAAAAATTAGCCACAATACTCGCGCA^ 

CAAGCTCAAACGACTACCATTACACACTTATTGGTTTAATTTTTTGGACAGGATTTAAAATCCAAGAAGCATTTGA 

AAGCGCTGTTAATATTTTAACTAAAGACGAGCAAAAGCGCCTAATTTTTAATTTTAGAACAAAAACAGTAAAAG^^ 

ATTCAGGAAAATTTTGAAAAACTAATGCAAGAGAGAAATTCATGGATAAAAATCGTCGATAACATO 

ATGACAAAAATACGGGAGGATGCAAAGCTGATGGAAAAATTCTCGGAGAAGTAATAAGGGTTGGATA 

ACTCGACTCAAATAAAAGTATGCAAATTTTAAACAATATTGAAACACCGCTAAAAACCTGTTGTGACCACATACAC 

TACTAA 
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tOlA.nt BBOOl 

TGCTCTTTTTATTCTAAATCAAACAACACAGAAGCGATAAGTGAATTACAATCAAGCCCTATTAAACTTGGAAAAA 

TTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACCCAAAATCTTCAAAACTTACAACAAAGCCAGTTCTTTAA 

AAATGAAAAAGAAAAAATAATTAAAAAAATTGCACAAGAATTTGATGAGAATGAAAAATTGATTAATAAAATA^ 

CCAAATATCGAAATGTTTGCTCAAACAATAAACACGGATATTCAAAAAATCGAACCTAATGATCAATTTGG 

ATAAAACTTTATTCACAGAAAAAAAAGACAATAATATTGACTTTATGTTAAAAGACAATCGACTTAGAAGATTATT 

TTACTCATCTTTAAATTATGATGAAAATAAAATCAAAAAATTAGCCACAATACTCGCGCAAACATCAAGCTCAAAC 

GACTACCATTACACACTTATTGGTTTAATTTTTTGGACAGGATTTAAAATCCAAGAAGCATTTGAAAGCGCTGTTA 

ATATTTTAACTAAAGACGAGCAAAAGCGCCTAATTTTTAATTTTAGAACAAAAACAGTAAAAGAGATTCAGGAAAA 

TTTTGAAAAACTAATGCAAGAGAGAAATTCATGGATAAAAATCGTCGATAACATTATTGGCGAATATGACAAAAAT 

ACGGGAGGATGCAAAGCTGATGGAAAAATTCTCGGAGAAGTAATAAGGGTTGGATACGAGCATGAACTCGACTCAA 

ATAAAAGTATGCAAATTTTAAACAATATTGAAACACCGCTAAAAACCTGTTGTGACCACATACACTAC 

fOlA.aa BBOOl 

LIFFKDYVLKRNKIWKTLKLFQITLLFSCSFYSKSNNTEAISELQSSPIKLGKIKVLQKTEKIVSTQNLQNLQQSQ 
FFKNEKEKIIKKIAQEFDENEKLINKIGPNIEMFAQTINTDIQKIEPNDQFGINKTLFTEKKDIONIDFMLKDNRLR 
RIJFYSSLNYDENKIKKIiATILAQTSSS^^^YHYTIiIGLIFWTGFKIQEAFESAVNILTKDEQKRLIFNFRTKTVKEI 
QENFEKLMQERNSWIKIVDNIIGEYDKNTGGCKADGKILGEVIRVGYEHELDSNKSMQILNNIETPLKTCCDHIHY 

tOlA.aa BBOOl 

CSFYSKSNNTEAISELQSSPIKLGKIKVLQKTEKIVSTQNLQNLQQSQFFKNEKEKIIKKIAQEFDENEKLINKIG 
PNIEMFAQTINTDIQKIEPNDQFGINKTLFTEKKDNNIDFMLKDNRLRRLFYSSIiNYDENKIKK^ 
DYHYTLIGLIFWTGFKIQEAFESAVNILTKDEQKRLIFNFRTKTVKEIQENFEKLMQERNSWIKIVDNIIGEYDKN 
TGGCKADGKILGEVIRVGYEHELDSNKSMQILNNIETPLKTCCDHIHY 

f02A.nt BB002 

TAATTAATACTGGTTTTAATTTATAAGGAGAGTATTTTGAAAAAAGCCAAACTAAATATAATCAAGATTAATATTA 
TTACAATGATATTAACTTTAATTTGCATCTCATGTGCACCTTTTAACAAAATCAATCCCAAGGCAAATGAAAACAC 
CAAGCTTAAAAAAAACACCAGACTGAAAAAACCCGCCAATCCAGGGGAAAACATCCAAAATTTTAAAGATAAATCT 
GGAGACCTTGGCGCTTCTGATGAAAAATTTATGGGAACTACCGCTTCAGAGCTAAAAGCAATTGGTAAGGAGCTAG 
AAGATCGAAAAAATCAATACGATATACAAATAGCCAAAATTACTAATGAAGAATCTAACCTATTAGATACTTATAT 
TCGGGCTTATGAACTAGCTAACGAAAATGAAAAAATGCTTTTAAAAAGATTTCTTCTTTCATCTTTAGATTATAAA 
AAAGAAAACATAGAGACATTAAAAGAAATTCTTGAAAAACTCATAAATAATTACGAAAACGACCCCAAAATTGCTG 
CAAATTTCCTTTATCGCATAGCGCTGGATATTCAATTAAAACTGGAAAAGCACTTAAAATCAATAAATGAAAAACT 
GGACACTCTAAGCAAAGAAAATTCAAAAGAAGATTTAGAGGCGTTGCTAGAACAAGTAAAATCTGCCTTACAGCTA 
CAAGAAAAGTTTAAAAAAACCCTAAACAAAACTCTTGAAGATTACCGTAAAAATACTAACAACATTCAAGAAAATA 
AAGTACTAGCAGAACACTTTAATAAATATTACAAAGACTCTGATTCTTTACAATCTGCCTTTTATTAA 

t02A.nt BB002 

TGTGCACCTTTTAACAAAATCAATCCCAAGGCAAATGAAAACACCAAGCTTAAAAAAAACACCAGACTGAAAAAA 

CCGCCAATCCAGGGGAAAACATCCAAAATTTTAAAGATAAATCTGGAGACCTTGGCGCTTCTGATGAAAAATTTAT 

GGGAACTACCGCTTCAGAGCTAAAAGCAATTGGTAAGGAGCTAGAAGATCGAAAAAATCAATACGATATACAAATA 

GCCAAAATTACTAATGAAGAATCTAACCTATTAGATACTTATATTCGGGCTTATGAACTAGCTAACGAAAATGAAA 

AAATGCTTTTAAAAAGATTTCTTCTTTCATCTTTAGATTATAAAAAAGAAAACATAGAGACATTAAAAGAAATTCT 

TGAAAAACTCATAAATAATTACGAAAACGACCCCAAAATTGCTGCAAATTTCCTTTATCGCATAGCGCTGGATATT 

CAATTAAAACTGGAAAAGCACTTAAAATCAATAAATGAAAAACTGGACACTCTAAGCAAAGAAAATTCA^^ 

ATTTAGAGGCGTTGCTAGAACAAGTAAAATCTGCCTTACAGCTACAAGAAAAGTTTAAAAAAACCCTAAACAAAAC 

TCTTGAAGATTACCGTAAAAATACTAACAACATTCAAGAAAATAAAGTACTAGCAGAACACTTTAATAAAT 

AAAGACTCTGATTCTTTACAATCTGCCTTTTAT 
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LILVLIYKESILKKAKIJSrilKINIITMILTLICISCAPFNKINPKANENTKLKKNTRLKJCPANPGENIQNF^ 
DLGASDEKFMGTTASELKAIGKELEDRKNQYDIQIAKITNEESNLLOTYIRAYELANENEKM^ 
ENIETLKEILEKLINl^ENDPKIAANFLYRIALDIQLKLEKHLKSINEKLIXrLSKENSKEDLEALLEQVKSAL 
EKFKKTLNKTLEDYRKNTNNIQENKVLAEHFNKYYKDSDSLQSAFY 

t02A.aa BB002 

CAPFNKINPKANENTKLKKNTRLKKPANPGENIQNFKDKSGDIiGASDEKFMGTTASELKAIGKELEDRKNQYDIQI 
AKITNEESNLLDTYIRAYELANENEKMLLKRFLLSSLDYKKENIETLKEILEKLIN^nfE^roPKIAANFL^ 
QLKLEKHLKSINEKLDTLSKENSKEDLEALLEQVKSALQLQEKFKKTLNKTLEDYRKNTISnsriQENKVLAEHFNKYY 
KDSDSLQSAFY 

f03A.nt BB006 

TGATTTAATGTAAATTTTAATTACCGCCTAAAAAAGGCTTTAAATGGTATAAAGGAAGAAGATCTAATGGTATTTA 
GAACATATAAACATTTGGAACTAATAATGCTGCCCATGTTAATGCTGAGTTGCGCTTTTTTTAAGAAACCACAATC 
TGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTACATTTAATATCAGGCAAAATTTCAAAT 
AAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAACAAAGGCAATGACAATCTTAGGCGAAG 
ATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATAATATCTCCTGTAAAAATGGATGGAAA 
ATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATGGAGATGATGAATATGAAATTGAAGAT 
GTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTTAGCTGTTGAAAATTCACAAGAAGAAG 
GATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAAAATGCTTTTAAATTAACATATAAAAA 
TGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTACTCAAGAAACTAAAATTTATAAAATT 
TCTCTTAATTCAAJ\ATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGAAAATTCTATATTAAAAGACATAGCTG 
GAGATTTATTTGAAGATATATAA 

t03A.nt BB006 

TGCGCTTTTTTTAAGAAACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTAC 

ATTTAATATCAGGCAAAATTTCAAATAAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAAC 

AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATA 

ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATG 

GAGATGATGAATATGAAATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTT 

AGCTGTTGAAAATTCACAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAA 

AATGCTTTTAAATTAACATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTT 

CTCAAGAAACTAAAATTTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGA 

AAATTCTATATTAAAAGACATAGCTGGAGATTTATTTGAAGATATA 

f03A.aa BB006 

FNVNFNYRLKKAIJSrGIKEEDLMVFRTYKHLELIl^PMLMLSCAFFKJCPQSVHQDSNTC 

KLPIINSNHDVTWIKTKAMTILGEDGKEIPEFKNKFGYSYIISPVKMDGKYSYYASLLILFETTKNGDDEYEIEDV 

KFVTAGSTLELKNSLI^VENSQEEGYWAYPFGIIiMSDEIKNAFKLTYKNGHWNYML^ 

liNSKLIIEFLKEVLKENSILKDIAGDIiFEDI 

t03A.aa BB006 

CAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTILGEDGKEIPEFKNKFGYSYI 

ISPVKMIXSKYSYYASLLILFETTKNGDDEYEIEDVKFWAGSTLELKNSLIAVENSQEEGY^ 

NAFKLTYKNGHWNYMLADLTVKNKLTQETKIYKISLNSKLIIEFLKEVLKENSILKDIAGDLFEDI 

f04A.nt BBOll 

TAATTACCAAAGATAAGTAAACTTGCAAATAAAACTACACGTATTGAAAGTAGATTTGAAATTTCCATTATATTTA 

TATATAATGGCACTAAATATCTGAAAATGAAGGAGAAGCGGGTGGGCAATAAAATTTTTTA 

AATTTTAATAGTTGGTTGCGACTGGGGAACTATTAAAGATAAAAGTACAGT^AATTTC^ 
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AAAGATAAGACTAAAAATCAAGATAGAATAGAATTGGGTGAAGATAATTTTGTATCTAAAAATAATATGTCTACTA 
CTGATACGGGCATTACTAGTTTAGGAAGTCTAAACAACTTGGATTTAATTAATCGTTCACAGCGGGTCAGTGAACC 
ACCTATAATCTCAAATGAGAAAGCCATAGCTACTCAAGCAAAAGTAGATTTAATGAACAACATTAATGTTACTATA 
ATAAACCCAAAACCAGCTCAAAATTTGGGAAATTCTTTT^AACAATACTACTACTGAAGATAGTGTGAAGTTTTTAT 
CAATTGAAAACCAAGAGTGGCTTATTAGTAAAAAGATTTTGCCCAGTAAGTTGGAAAATTTAGAAAGCTTTCTAAA 
AACACAACACGAAAAAGAAGCTTTTAAGACGGCTAAAACTATACAAAGTCTCATTAGTAATTCCAATATGGGTAAA 
GAAATTATTAAGTTTAAGGAAGAATATTACAAACTTTATAATTTGTTTGAAGGCATACAACAAAAATTCCATAGTC 
AAAGGAATTCATTTATAAAAGATACTAAATTTGGGGAAAATAGACAAAAAAATGCAGTTATATTTAAATCCTTTTC 
ATCTATAGAGAAAGAAATTAGAGATTTGAATTATAAGTTGNGTGAAATCCAAAGTAATTTTCAAATTGCAGATGT^ 
AGCTGGAATAATGCAAACTCTCTTTTAAAAGAATCTATAGAAAAATTAATTCAGGCAATTGAAAAAAGGTATGACA 
ATGAGAGTAGAAAGCAAGGTCAAATTGGTGGACCTGCTAATAGATGGGATAAAAATCAAGCTGACAATTTTGCTAA 
GGATGCAAAGTATAAGGCAGAACATTCAGCAAATGATTTGGAAAATGCAGCCAACTATTTTAGATATAGTTGTTCA 
AATGAAAAAGAAGCTAAAAAGCTATTAGAAGAAATTAAAAAAAGATTTGTACGAATTGGTATTAGCCTATAA 

t04A.nt BBOll 

TGCGACTGGGGAACTATTAAAGATAAAAGTACAGAAATTTCCAAGCTATTAAGAACGGACAAAGATAAGACTAAAA 
ATCAAGATAGAATAGAATTGGGTGAAGATAATTTTGTATCTAAAAATAATATGTCTACTACTGATACGGGCATTAC 
TAGTTTAGGAAGTCTAAACAACTTGGATTTAATTAATCGTTCACAGCGGGTCAGTGAACCACCTATAATCTCAAAT 
GAGAAAGCCATAGCTACTCAAGCAAAAGTAGATTTAATGAACAACATTAATGTTACTATAATAAACCCAAAACCAG 
CTCAAAATTTGGGAAATTCTTTAAACAATACTACTACTGAAGATAGTGTGAAGTTTTTATCAATTGAAAACCAAGA 
GTGGCTTATTAGTAAAAAGATTTTGCCCAGTAAGTTGGAAAATTTAGAAAGCTTTCTAAAAACACAACACGAAAAA 
GAAGCTTTTAAGACGGCTAAAACTATACAAAGTCTCATTAGTAATTCCAATATGGGTAAAGAAATTATTAAGTTTA 
AGGAAGAATATTACAAACTTTATAATTTGTTTGAAGGCATACAACAAAAATTCCATAGTCAAAGGAATTCATTTAT 
AAAAGATACTAAATTTGGGGAAAATAGACAAAAAAATGCAGTTATATTTAAATCCTTTTCATCTATAGAGAAAGAA 
ATTAGAGATTTGAATTATAAGTTGNGTGAAATCCAAAGTAATTTTCAAATTGCAGATGTTAGCTGGAATAATGCAA 
ACTCTCTTTTAAAAGAATCTATAGAAAAATTAATTCAGGCAATTGAAAAAAGGTATGACAATGAGAGTAGAAAGCA 
AGGTCAAATTGGTGGACCTGCTAATAGATGGGATAAAAATCAAGCTGACAATTTTGCTAAGGATGCAAAGTATAAG 
GCAGAACATTCAGCAAATGATTTGGAAAATGCAGCCAACTATTTTAGATATAGTTGTTCAAATGAAAAAGAAGCTA 
AAAAGCTATTAGAAGAAATTAAAAAAAGATTTGTACGAATTGGTATTAGCCTA 

f04A.aa BBOll 

LPKISKLANKTTRIESRFEISIIFIYNGTKYLKMKEKRVGNKIFYISWLILIVGCDWGTIKDKSTEISKLLRTDK 

DKTKNQDRIELGEDNFVSKNI^STTDTGITSLGSLNNLDLimSQRVSEPPIISNEKAIATQAKVDLMm 

NPKPAQNLGNSLNNTTTEDSVKFLSIENQEWLISKKILPSKLENLESFLKTQHEKEAFKTAKTIQSLISNSNMGKE 

IIKFKEEYYKLYNLFEGIQQKFHSQRNSFIKDTKFGENRQKNAVIFKSFSSIEKEIRDLNYKLXEIQSNFQIADVS 

WNNANSLLKESIEKLIQAIEKIlYDNESRKQGQIGGPANRWDKNQApNFAKDAKYKAEHSANDIiENA;^ 

EKEAKKIiLEEIKKRFVRIGISL 

t04A.aa BBOll 

CDWGTIKDKSTEISKLLRTDKDKTKNQDRIELGEDNFVSKNNMSTTDTGITSI^SLl^ 
EKAIATQAKVDUyiNNINVTIINPKPAQNLGNSIJSnsrrT^ 

EAFKTAKTIQSLISNSNMGKEIIKFKEEYYKLYNLFEGIQQKFHSQRNSFIKDTKFGENRQKNAVIFKSFSSIEKE 

IRDLNYKLXEIQSNFQIADVSWl^ANSLLKESIEKLIQAIEKRYDNESRKQGQIGGPANRWDKNQADNFAKDA^ 

AEHSANDLENAANYFRYSCSNEKEAKKLLEEIKKRFVRIGISL 

fOSA.nt BB009 

TAAATAAATTGTAGGATAAAAATGAAACAAAAATACGAAAACTATTTTAAAAAAAGATTAATTTTAAACCT^ 

TATTTTTACTACTAGCATGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAAT^ 

CAATATTTTGGGCAGTTCAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCATGCATTT^ 

TTTAAAAAAGAAAACGGCAAGATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTT^ 

TATCTGGAAAAACCTATCTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAAT^ 

GACATTAAAATTTAAAAAACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGAAGAGATGGCGTAA 
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tOBA.nt BB009 

TGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATACAATATTTTGGGCAGTT 
CAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCATGCATTTATTTAAAAAAGAAAACGG 
CAAGATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAATATATCTGGAAAAACCTAT 
CTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTGGACATTAAAATTTAAAA 
AACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGAAGAGATGGCG 

fOSA.aa BB009 

INCRIKMKQKYENYFKKRLIIJSrLLIFLLLACSSESXFSQI^NLQKIKHEYNII^SSSPRGISLVGETLYIAA^ 
KKENGKIEKIDLSNSYEFINDIVNISGKTYLLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFLKSVEEMA 

tOSA.aa BB009 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFKKENGKIEKIDLSNSYEFINDIVNISGKTY 
LLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFLKSVEEMA 

f06A,nt BB014 

TAAGGAGCATATATGAGGATTTTGGTTGGCGTTTGTATAATAGCATTGGCTTTATTGGGTTGTTATTTGCCTGATA 
ATCAGGAACAAGCTGTTCAAACTTTTTTTGAGAATTCGGAAAGTAGTGATATGGGTTCCGATGAGATTGTTACTGA 
AGGCATATTTTCTAGTTTAAAATTATATGCGTCTGAACATCGTTTATTGGTTGAGATAAAAAAGACTTTAATTAGT 
TTAAAAGATCCTAATTATCNNGNTGTAGTACNCCCAGTGAGTGACTATAATGAGGAGTATTTTAATAAATTCTTTC 
TAGATTTAGGGTCTGAGCAATCTAAAGACCTGATTAAGTTGTTTATTATGGTAAAAAATGAGCAGAACAATAATAA 
ATTTATGCGTATAGTTCGTTGGCTGTATTCATGTATAGAGGAGTTATATTCTCTAGATATTAAGTATTCTGGCGAG 
GGGAGCCATGAGTATAATCGTAATATGCCTAGACCCACTGCTTATGAACAATATTTAAAAGTGAAGAGGTATGATT 
ATAATAGCCCAGTTTCTATTTTACCTACATAA 

t06A.nt BB014 

tgttatttgcctgataatcaggaacaagctgttcaaactttttttgagaattcggaaagtagtgatatgggttccg 

atgagattgttactgaaggcatattttctagtttaaaattatatgcgtctgaacatcgtttattggttgagataaa 

aaagactttaattagtttaaaagatcctaattatclsnsrgntgtagtacncccagtgagtgactataatgagg 

tttaataaattctttctagatttagggtctgagcaatctaaagacctgattaagttgtttattatggtaaa;^ 

agcagaacaataataaatttatgcgtatagttcgttggctgtattcatgtatagaggagttatattctctagatat 

taagtattctggcgaggggagccatgagtataatcgtaatatgcctagacccactgcttatgaacaatatttaaaa 

gtgaagaggtatgattataat 

fOSA.aa BB014 

GAYMRILVGVCIIAIJU^LGCYLPDNQEQAVQTFFENSESSDMGSDEIVTEGIFSSLKLYASEHRLLVEIKKTLIS^ 
KDP^^raCVVXPVSDYl^EYFNKFFLDI^SEQSKIDLIKLFIMVKNEQ^^^KFM^ 
SHEYNRNMPRPTAYEQYLKVKRYDYNS PVS I LPT 

t06A.aa BB014 

CYLPDNQEQAVQTFFENSESSDMGSDEIVTEGIFSSLKLYASEHRLLVEIKKTLISLKDPNYXXVVXPVSDYNEEY 

FNKFFLDLGSEQSKDLIKLFIMVKNEQNNNKFmiWWLYSCIEELYSLDIKYSGEGSHEYK^^ 

VKRYDYN 

f07A.nt BB023 

TAAAGTATTTTATTTTTTTTATTATCCACTGTTCTTTTTGCTCAAGAGACT^ 
GGGCAGAGCCTGGAGAATTAGTTTTAGATTTTGCCGAGCTTGCAAGAGATCCAAGOT 
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AAATTATGTTGATTATGTATATTCGGGCGCTTCTGGTATTGTTAAGCCGGAAGATATGGT^ 

AATAATTGGAGCGTTTTACTTACTCCTTCTGCAAGGTTGCAGGCTTACGTTAAAAATTCAGTTGTTGCGCCCGCTC 
TTGTTAAGAGTGAGTCAAAAAGGTACGCAGGTGATACTATTTTAGGGGTAAGAGTTTTGTTTCCAAGCTATTCTCA 
ATCATCTGCTATGATTATGCCACCATTTAAAATTCCTTTTTATTCAGGGGAAAGTGGCAATCAATTTTTAGGCAAA 
GGTCTTATTGATAACATTAAAACCATGAAAGAAATTAAGGTATCTGTTTATAGTTTAGGGTATGAGATAGATCTTG 
AGGTTTTATTTGAAGATATGAATGNCATGGAATATGCOTNNTCTATGGGTACTTT 

tttaatttggtcaaatcctaactatattcctaatatatcatccagaattattaaagacgatgttccaaattatcct 
cttgcttcaagtaaaatgagatttaaggcttttagagtttcaaagtcacacagttca;^^ 
tttatgttaaagatttaagagttctttatgataagttgagtgtttcaatagattctgatattgacagtgagtctgt 
atttaaagtttatgagactagcggaactgaatcccttcgtaaattaaaggcacacgnaacntttaaaagngtt^ 

AAGCTTAGAGAAAAAATTTCTATGCCTGAAGGCTCTTTCCAAAACTTTGTAGAAAAGATTGAGAGTGAAAAACCTG 
AAGAATCATCTCCGAAAAATTAG 

tOVA.nt BB023 

GAGGGTTCTAAAAGGGCAGAGCCTGGAGAATTAGTTTTAGATTTTGCCGAGCTTGCAAGAGATCCAAGTTCAACTA 

GACTTGATCTTACAAATTATGTTGATTATGTATATTCGGGCGCTTCTGGTATTGTTAAGCCGGAAGATATGGTTGT 

AGATCTTGGGATAAATAATTGGAGCGTTTTACTTACTCCTTCTGCAAGGTTGCAGGCTTACGTTAAAAATTCA 

GTTGCGCCCGCTGTTGTTAAGAGTGAGTCAAAAAGGTACGCAGGTGATACTATTTTAGGGGTAAGAGTTTTGTTTC 

CAAGCTATTCTCAATCATCTGCTATGATTATGCCACCATTTAAAATTCCTTTTTATTCAGGGGAAAGTGGCAATCA 

ATTTTTAGGCAAAGGTCTTATTGATAACATTAAAACCATGAAAGAAATTAAGGTATCTGTTTATAGTTTAGGGTAT 

GAGATAGATCTTGAGGTTTTATTTGAAGATATGAATGNCATGGAATATGCTTNNTCTATGGGTACTTTAAAGTTTA 

AAGGGTGGGCTGATTTAATTTGGTCAAATCCTAACTATATTCCTAATATATCATCCAGAATTATTAAAGACGATGT 

TCCAAATTATCCTCTTGCTTCAAGTAAAATGAGATTTAAGGCTTTTAGAGTTTCAAAGTCACACAGTTCAAAAGAG 

CAAAATTTCATCTTTTATGTTAAAGATTTAAGAGTTCTTTATGATAAGTTGAGTGTTTCAATAGATTCTGATATTG 

ACAGTGAGTCTGTATTTAAAGTTTATGAGACTAGCGGAACTGAATCCCTTCGTAAATTAAAGGCACACGNAACNTT 

TAAAAGNGTTTTAAAGCTTAGAGAAAAAATTTCTATGCCTGAAGGCTCTTTCCAAAACTTTGTAGAAAAGATTGAG 

AGTGAAAAACCTGAAGAATCATCTCCGAAAAAT 

f07A.aa BB023 

SILFFLLSTVLFAQETIX3LAEGSKRAEPGELVLDFAELARDPSSTRLDLTKnfVDyVYSGASGI\^ 

NWSVLLTPSARLQAYVKNSWAPAVVKSESKRYAGDTIIX3VRVLFPSYSQSSAMIMPPFKIPFYSGESGNQFLGKG 

LIDNIKTMKEIKVSWSLGYEIDLEVLFEDMNXMEYAXSMGTLKFKGWADLIWSNPNYIPNISSRIIKDDVPlSr^ 

ASSKMRFKAFRVSKSHSSKEQNFIFYVKDLRVLYDKLSVSIDSDIDSESVFKVYETSGTESLRKLKAHXTFKXVLK 

LREKISMPEGSFQNFVEKIESEKPEESSPKN 

t07A.aa BB023 

EGSKRAEPGELVLDFAELARDPSSTRIiDLTNYVDYVYSGASGIVKPEDMVVDLiGINl^ 

VAPAVVKSESKRYAGDTILGVRVLFPSYSQSSAMIMPPFKIPFYSGESGNQFIXSKGLIDNIKTMKEIKVSW 
EIDLE^^^FEDM^DCMEYAXSMGTLKFKGWADLIWSNPNYIPNISSRIIKDDVPNYPL^ 

QNFIFYVKDLRVLYDKLSVSIDSDIDSESVFKVYETSGTESLRKLjKAHXTFKXVLKLREKISMPEGSFQNFVEKIE 
SEKPEESSPKN 

fOSA.nt BB024 

TGAATATTAATAATAAAAAAAGGAGTAACAATGAAAATCATCAACATATTATTTTGTTTATTTTTACTAATGCTAA 

ACGGCTGTAATTCTAATGATAATGACACTTTAAAAAACAATGCCCAACAAACAAAAAGACGGGGAAAGCGTGATTO 

AACCCAAAAAGAAACAACACAAGAAAAACCAAAATCTAAAGAAGAACTACTTAGAGAAAAGCTATCTGACGATCAA 

AAAACACATCTTGACTGGTTAAAACCCGCTTTAACTGGTGCTGGAGAATTTGACAAATTCTTAGAAAA^ 

ATAAAATAAAATCAGCACTTGATCATATAAAAACTCAACTTGATAGTTGTAATGGTGATCAAGCAG 

AACCACTTTCAAAACTGTGGTTACAGAATTCTTTAAAAATGGTGATATAGATAATTT^ 

AACTGCAATAATGGTGGCTAA 



tOSA.nt BB024 
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TGTAATTCTAATGATAATGACACTTTAAAAAACAATGCCCAACAAACAAAAAGACGGGGAAAGCGTGATTTA^ 

AAAAAGAAACAACACAAGAAAAACCAAAATCTAAAGAAGAACTACTTAGAGAAAAGCTATCTGACGATCAAAAAAC 

ACATCTTGACTGGTTAAAACCCGCTTTAACTGGTGCTGGAGAATTTGACAAATTCTTAGAAAATGATGATGATAAA 

ATAAAATCAGCACTTGATCATATAAAAACTCAACTTGATAGTTGTAATGGTGATCAAGCAGAACAACAAAAAACCA 

CTTTCAAAACTGTGGTTACAGAATTCTTTAAAAATGGTGATATAGATAATTTTGCAACTGGAGCGGTTAGTAACTG 

CAATAATGGTGGC 

£08A.aa BB024 

ILIIKKGVTMKIINILFCLFLLMLNGCNSNDNDTLKimAQQTKRRGKJUDLTQKET^ 

THLDWLKPALTGAGEFDKFLENDDDKIKSALDHIKTQLDSCNGDQAEQQKTTFKTWTEFFKNGDIDNFATGAVSN 
CNNGG 

tOSA.aa BB024 

ClSrSNDNDTLKNNAQQTKRRGKRDLSlTQKETTQEKPKSKEELLREKLSDDQKTHLDWLKPALTGAGEFDKFLEr^ 
DKIKSALDHIKTQLDSCNGDQAEQQKTTFK1WTEFFKNGDIDNFATGAVSNCNNGG 

f09A.nt BB025 

TGAATATTAATAATAAAAAAAGGAATAATAATGAAAATTATCAACATATTATTTTGTTTATTTTTACTAATGCTAA 
ACGGCTGTAATTCTAATGATACTAATAATAGCCAAACAAAAAGTAGACAAAAACGTGATTTAACCCAAAAAGAAGC 
AACACAAGAAAAACCTAAATCTAAAGAAGAACTTCTTAGAGAAAAGCTAAATGATAATCAAAAAACACACCTTGAC 
TGGTTAAAAGAAGCTCTGGGCAATGATGGAGAATTTAATAAATTTTTAGGATATGATGAAAGCAAAATAAAATCTG 
CACTTGATCATATAAAGAGTGAACTTGACAGTTGTACTGGAGATAAGGTTGAAAATAAAAATACCTTCAAGCAGGT 
CGTTCAGGAGGCCCTTAAAGGGGGCATAGACGGCTTTGAAAATACTGCAAGTAGTACGTGCAAAAATTCATAA 

t09A.nt BB025 

TGTAATTCTAATGATACTAATAATAGCCAAACAAAAAGTAGACAAAAACGTGATTTAACCCAAAAAGAAGCAACAC 
AAGAAAAACCTAAATCTAAAGAAGAACTTCTTAGAGAAAAGCTAAATGATAATCAAAAAACACACCTTGACTGGTT 
AAAAGAAGCTCTGGGCAATGATGGAGAATTTAATAAATTTTTAGGATATGATGAAAGCAAAATAAAATCTGCACTT 
GATCATATAAAGAGTGAACTTGACAGTTGTACTGGAGATAAGGTTGAAAATAAAAATACCTTCAAGCAGGTCGTTC 
AGGAGGCCCTTAAAGGGGGCATAGACGGCTTTGAAAATACTGCAAGTAGTACGTGCAAAAATTCA 

f09A.aa BB025 

ILIIKKGIIMKIINILFCLFLLMLNGCNSNDTISnsrSQTKSRQKRDLTQKEATQEKPKSKEELLREKL^ 
LKEALGNDGEFNKFLGYDESKIKSALDHIKSELDSCTGDKVENKNTFKQWQEALKGGIIX5FENTASSTCKNS 

t09A.aa BB025 

CNSITOTNNSQTKSRQKRDLTQKEASITQEKPKSKEELLREKLNDNQKTHLDWLKEALGNT^ 

ALDHIKSELDSCTGDKVENKNTFKQWQEALKGGIDGFENTASSTCKNS 
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2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent d 


4.60E-15I 


3.50E-13I 


6.20E-13I 


5.00E-11I 


1.30E-68 


5.80E-24 


1.90E-35 


2.70E-35 


2.70E-35 


3.60E-35 


3.60E-35 


5.20E-26 


6.50E-23 


6.50E-23 


1.20E-i9 


2.90E-23 


6.90E-23 


2.20E-22 


l.OOE-18 


7.50E-18 


4.80E-16 


6.20E-13 


l.lOE-12 


2.80E-12i 


2.30E-11 


2.40E-11 


6.80E-30 


3.80E-229 


1.90E-117 


3.20E-96 


7.90E-16 
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1253 
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in 
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Antibiotic potentiating peptide #3. 


B. burgdorferi OspF operon. 


B. burgdorferi OspG lipoprotein. 


Trypanosoma cmzi antigen. | 


B. burgdorferi OspF operon. | 


|B. burgdorferi OspG lipoprotein. | 


Ethylene response (ETR) mutant protein etrl-3. 


Ethylene response (ETR) gene product. | 


Ethylene response (ETR) mutant protein etrl-4. 


Ethylene response (ETR) mutant protein etrl-1. 


Ethylene response (ETR) mutant protein etrl-2. 


QETR ethylene response (ETR) protein from Arabidopsis thaliana. 


Tomato ethylene response (TETR) protein. 


Nr (never ripe) tomato ethylene response (ETR) protein. 


Tomato TGETRl ethylene response protein. 


Human thyrotropin GPR N-terminal sequence. 


B. burgdorferi OspG lipoprotein. 


Human follicle stimulating hormone GPR N-terminal sequence. 


Mannose-1 -phosphate transferase protein MNN4. 


Leucocytozoan protozoa structural protein epitope. 


Chicken leucocytozoan immunogenic protein for use in vaccines. 


Antibiotic potentiating peptide #3. 


Plasmodium falciparum ABRA gene protein. 


Plasmodium falciparum bloodand liver stage ABRA antigen. 


Human pi 60 polypeptide 160.2. 


Human pi 60 polypeptide 160.1. 


Staphylococcus aureus Gene #1 polypeptide sequence 2. 


CO 

ON 
CO 

CU 


P39-beta. 


|P39-beta. 


|P39-alpha. 


W21591 1 


R69629 1 


R89476 1 


W26536 1 


R69629 1 


R89476 1 


R69852 1 


R69849 
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m 
oo 
as 
vo 


R69850 


R69851 


R74632 


ON 
CS 
vo 


R74633 


R74630 


W03626 


R89476 


W03627 


W30763 


R70491 


R97866 


W21591 


R68838 


R27530 


W31186 


W31185 


W24296 


R31013 


R33280 


R33280 


1R31013 


|f5-14.aa 


|f5-14.aa 


|f5-14.aa 
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CO 
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|f51-2.aa 
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f51-2.aa 


f51-2.aa 


|f517.aa 


f541.aa 


|f541.aa 


|f542.aa 
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TABLE 3. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 
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TABLE 4. Residues Comprising Epito-Bearing Fragments 



Query 


Residues Comprising Epito-Bearing Fragments 


flOl.aa 


from about Lys-62 to about Gly-64, from about Ser-1 1 1 to about 
Asp-1 13, from about Arg-136 to about Arg-139, from about Pro- 
icsy to aoout Asn-ivj. 


fll.aa 


from about Pro-38 to about Lys-40, from about Glu-92 to about 
Lys-Vo. 


il2.aa 


irom about rro-Zoo to about Asp-zyu, trom aoout Asn-o-5o to 
about Gly-338, from about Tyr-410 to about Gly-413, from about 

from 




about Gln-643 to about Asp-645, from about Gin- 1061 to about 
/\rg-iUDj, iTom aooui /\sn-i ldsj lo aooui i-*yb-i ux. 


fl29.aa 


from about Glu-76 to about Arg-81, from about Lys-144 to about 
/\sn- i*+o. 


fl47.aa 


from about Gln-94 to about Thr-96. 


ilDz.aa 


irom about Ljiy-JD to about oiy-j /, irom aoout urm-ozi to aooui 
oiy-jZj. 


fl54.aa 


from about Asn-39 to about Lys-41, from about Ser-74 to about 

T \7C 77 frr\m QV*r*nf ^f^r-'? 1 fr* nKfMit r^l\/-91S frnm aHnnt ^f^r-'^O'^ 

jL*ys- / /, rroni auoui ^cr-zij lo douui vjiy~zi*j, iruiii auuuL oci~-7V7j 
tn aHniit AQT>-'^n6 frnm al^niit A^n-^.'?'? tn fiHniit A^n-4'24 


fl57.aa 


from about Lys-21 to about Asp-24, from about Ser-45 to about 

Tvr-47 


fl7.aa 


from about Arg-17 to about Asn-20, from about Thr-94 to about 


f 1 O/C o;) 

I loD.aa 


frnm i^hnnt T vq-'^OS fn ^hniit Tvr-'^OR 




frr*m 'iHoiit T vc-l'?! tn i^Hnnt Acn-19'^ frnm Jihniit Prn-978 tn about 

Lys-282, from about Glu-576 to about Tyr-578. 


f899.aa 


from about Asn-174 to about Asp- 177. 


f925.aa 


from about Lys-201 to about Asp-204, from about Phe-291 to 
aboul i-*ys-z^:^'i. 


ryzy.aa 


irom aooul rro-i^y lo aooui Asn-i'f i, irom <idoui /\rg-zi i lu auuui 

nin-Ol A frnm aViniit Thr-'^7n tn ahnnt A<:n-'^7S 


f933.aa 


from about Ser-1 39 to about Lys-143. 


rV4U,aa 


irom aooui vjiy-i*f j lo aooui /\sn-i*fo. 


f943.aa 


from about Asp-58 to about Asp-60, from about Lys-157 to about 
Asn-159, from about Asp-217 to about Asp-221, from about Lys- 
zju to aooui /\sn-z->H, irom duoui i ro-zoz lo auuuL /\^ii-zuh-, iilhh 
about GIv-305 to about Trp-307. 


173'^. aa 


frr\nr» oVvrf^iit Q^i"-.^7 fr\ s*V*r\iit ^^f_^4 
irUIll dULiUL OCl-J^ lU aUULIl OCl-J*+. 


f4.aa 


from about Arg-64 to about Arg-67. 


i4J.aa 


Irom aoout oer-o^ lo aooui oin-o/, irom aooui Asp-zji lo aooui 
Tyr-233, from about Arg-296 to about Asp-300, 


fSO.aa 


from about Glu-136 to about Gly-138, from about Asp- 153 to 

aKr\nt T \/c-1 frr%Tn nKriiit A«r%-7RO tn nViniit AQn-901 frnm aHnnt 
cLOkjxXI J-ryaUJ, lIUlll aUV^Ul AVop ZOj^ %\J a\J\JKl%. r\^^^J~^^ I , kHJiii o.\j\JU\. 

Glu-458 to about Asn-461. 


f65.aa 


from about Glu-120 to about Asp- 122, from about Pro-204 to about 
Tyr-206. 


fS.aa 


from about Pro-263 to about Arg-265, from about Asp-274 to about 
Lys-278. 


f82.aa 


from about Tyr-66 to about Gly-68, from about Ser-1 16 to about 
Lys-1 19, from about Asp-121 to about Gly-123, from about Pro- 
128 to about Gly-131. 
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loo.aa 


irom aDoui /\sn-i /y lo aDOUi /\sn-ioi, irom aooui i_^ys-iyz co 
about Asn-194, from about Lys-270 to about Asn-272, from about 


f477.aa 


from about Pro- 250 to about Lys-253. 


i4oo.aa 


trom aDout Lys-zo to aooui Lrys-zy, irom aooui Asn-^oo co aooui 
Asp-489, from about Lys-508 to about Gly-510, from about Asn- 


f494.aa 


from about Lys-76 to about Asn-78. 


iDlo.aa 


from about Lys-32 to about Asp-34. 


f523.aa 


from about Pro-202 to about Asn-206, from about Lys-255 to about 


iDzo.aa 


rrom aooui /\sn-oD lo aooui j-rys-oo, rrom aooui /\sp-i jo lo aooui 
vjiy-i JO. 


f577.aa 


from about Cys-18 to about Lys-22, from about Asn-297 to about 
Gln-300. 


iDo4.aa 


rrom aoout Jrro-ui lo aooui Lys-ioD^ rrom aooui rro-zuu lo aooui 


f596.aa 


from about Arg-42 to about Asp-44, from about Asp- 1 17 to about 
Tyr-1 19, from about Pro-205 to about Asp-207, 


louu.aa 


from about Pro- 143 to about Asp- 145. 


f603.aa 


from about Phe-35 to about Ser-37. 


i607.aa 


from about Gln-o7 to about Lys-70, from about Asp-273 to about 
lyr-z/D, rrom aoout Asp-^Jo lo aooui oiy-3:)o, rrom aooui rro- 
359 to about Lys-362, from about Arg-409 to about Gly-41 1. 


loll.aa 


irom about Arg-l3J to aoout Lriy-i J!>, 


f631.aa 


from about Pro- 132 to about Asn-136, from about Asn-159 to 
about Tyr-161, from about Pro-216 to about Asp-218, from about 
rro-zzU to about Lys-zzJ. 


f688.aa 


from about Lys-266 to about Asp-268, from about Lys-271 to 
about Asn-z / J, rrom about Lys-oiD to aooui i^ys-oio. 


f704.aa 


from about Lys-250 to about Lys-253. 


f707.aa 


from about Lys-131 to about Asp- 134, from about Asp-246 to 
about Asn-249. 


f709.aa 


from about Tyr-39 to about Cjly-4z, rrom about Lys-14o to about 
Cjly-lDU, irom about Arg-zoy to about oiy-z/z, rrom aooui oer- 
Hoo to aDOUi 1 yr"^oo, rrom dooui /\an-Ho^ wj duuui /\Mi-*T^r, iiviiii 




aooui i^ys-j / j lo dooui /^.ap-j /o, rrom auouL r ro-tr+z- lu u.u(jul j-rya- 
644 


fl97.aa 


from about Pro-217 to about Asp-219, from about Glu-675 to about 
A<;n-fi78 frnm fiVioiit Prn-6R7 to aHoiit A^n-68Q from about Glu- 

694 to about Gln-696. 


f200.aa 


from about Arg-174 to about Phe-179. 


f208.aa 


from about Arg-326 to about Ser-328. 


f210.aa 


from about Pro- 191 to about lie- 194. 


f221.aa 


from about Asn-133 to about Asn-135. 


f253.aa 


from about Arg-191 to about Gly-194. 


f269.aa 


from about Ser-271 to about Thr-273, from about Asp-284 to about 
Gly-286. 


f29.aa 


from about Pro- 159 to about Ser-161. 


f290.aa 


from about Pro-240 to about Gly-244. 


f291.aa 


from about Gln-267 to about Lys-269. 
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f296.aa 


from about Glu-98 to about Lys-101. 






f30.aa 


from about Asn-156 to about Tyr-159, from about Asn-178 to 

ahnnt T v<5-l RO 


f939.aa 


from about Ser-245 to about Asn-249. 


f739.aa 


from about Asn-80 to about Tyr-82, from about Lys-208 to about 


f742.aa 


from about Ser-141 to about Asp- 145, from about Asn-222 to about 
Gln-225, from about Asp-243 to about Tyr-247, from about Asn- 


£743.aa 


from about Arg-1 1 1 to about Gly-1 14, from about Pro- 131 to about 


f790.aa 


from about Thr-40 to about Asn-42, from about Ser-53 to about 

OCr-^*>, irom aUOUl i^ya-Zl J 10 aOUUL /\ap*Z10, irUIIi aXjyjiXV AV5U-x-/*T 

to about Gly-277. 


i/yz.aa 


irom aooui vai-oz lo auoui oer-o*f, irom auoui ocr-iu-i- lo d.uuuL 
Asn-104, from about Gin- 127 to about Tyr-130, from about Lys- 

'^00 fo ahonf Acn-'^14 frnm ahniit T v<;-'^7S to ^ihnnt Thr-^77 from 




5iHoiit Pro-S 1 1 to nhoiit HiQ-Sl firom aHoiit T'hr-SI S to fiHoiit A<;r)- 

517. 




from flhoiit Pro-1 IQ to about Glv-T^^ from about T.v<;-lfi6 to about 
Asn-169. 


f700 afl 


from Ahoiif A^n-"^! to about Asn-'^4 from about Gln-44 to about 
Asn-47, from about Pro- 123 to about Giy-125. 




from ahoiit ^f»r-19n to about .Sler-1 79 from about Arp-fi'^fi to about 

Asn-638, from about Cys-967 to about Ser-969. 




from about Thr-S6'^ to about Tvr-S6S 




from about Tvr-1 59 to about Tvr-164 from about Gln-375 to about 
Asp-379. 


fSS*^ aa 


from about Thr- 1 80 to about Lvs- 1 84 from about Are-23 1 to about 
Asp-233, from about Asn-252 to about GIy-254. 


f859.aa 


from about Lys-46 to about Ser-52, from about Pro-88 to about 

Acn-Q1 from about A^n-1 17 to about A<;o-190 

V^oll 7 X , XI will CL\J\J\X\. ^HLOll XX/ WJ H.L^V/LAl I^OyJ X^V* 


f861.aa 


from about Asp-38 to about Lys-40, from about Lys-219 to about 

AQn-99S 


f368.aa 


from about Gln-228 to about Asn-23 1 . 


f371.aa 


from about Tyr-109 to about Asn-1 1 1, from about Asn-162 to 
about Gin- 164. 


iDOz.aa 


irom about Asn-l itS to about l^ys-lzz, rrom aoout oer-zoy lo aoout 
Gly-271, from about Lys-370 to about Asp-373, from about Asn- 
D\j)7 lo aooui i^ys-jx i, rrom aooui i-»ys- /kjd lo douui /\rg-/u/, xiuixi 




auoui inr-^1^ lO aOOUi vjiy-yiH-, rrom aooui riU'i^LD lu auuui 
Asp-1216, from about Asn-1491 to about Arg-1493. 


f527 aa 


from about Gvs-20 to about Gln-22 from about Asn-38 to about 
Asn-40, from about Phe-1 12 to about Asp-1 14, from about Lys-160 
to about Asn-162, from about Ser-199 to about Asp-201, from 
about 




Gln-258 to about Gly-261, from about Arg-282 to about Asn-284, 
from about Ser-297 to about Asp-299. 


f541.aa 


from about Ser-68 to about Asn-71. 


f604.aa 


from about Lys-77 to about Gly-79, from about Lys-201 to about 
Asn-203, from about Asp-252 to about Asp-254, from about Tyr- 
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o't / lO aDOUl oiy-oju, irom auoui /vsp-ji'f lo aDoui irp-Jio. 


I /Jo.aa 


irom aDOUl Lys-zu lo aooui /\sn-ZH-, irom aooui /vrg-i^/ lo auoui 
Ser-153, from about Ser-231 to about Lys-233. 


I /DZ.aa 


irom auoui inr-i ly lo aouui i^yb-izZy irom aooui r^ro"*f^v/ lo auuui 


f798.aa 


from about Asp-33 to about Thr-36, from about Lys-180 to about 


f635.aa 


from about Pro- 100 to about Asn-102, from about Asp- 145 to 
ahniit Php-147 


f32.aa 


from about Lys-18 to about Asn-20. 


uzu.aa 


irom aooui /\sn-iyj> lo apuui lyo^ irom aDuui oin-zHo lu 
about Lys-250. 


i^Dz.aa 


irom aooui oer—fo lo d.Doui 


OOl.aa 


from about Lys-178 to about Lys-180, from about Ser-401 to about 

T\rf Af\A 


ij /3.aa 


irom aooui ijjy-oo lo auoui i->ys-yu, irom auoui /\sn-Djy lo aooui 

j^ya-^Hii., iiijiii clukjuk. vjiu-uj*t lu cil/oui ocx vjj /. 


f384.aa 


from about Pro-250 to about Asn-252, from about Asp-266 to 


f446.aa 


from about Asp-20 to about Ser-26, from about Asn-146 to about 




-ffrtm aT^rMif" Afo— tr% aVioiit r^lv-RR frr\m sihr^iit Afc— 1^*^ tr\ Jihmit 

IxUIll aUKjUkl /AXg OVJ LU aULlUL VJljr-OOy iiVJlll aUXJiXL J^±^ l\J <XU\Jilt 

Asn-165 


f93.aa 


from about Asn-152 to about Asp- 155. 


f 1 05 aa 


from about A<;n-48 to about Phe-50 


fl50 aa 


from about Thr-214 to about Aso-218 from about Asd-256 to 
about Asp-259. 


f219.aa 


from about Asn-77 to about Asn-81, from about Asp-1 1 1 to about 
Asn-115. 


f229.aa 


from about Gln-61 to about Asn-63. 


f32.aa 


from about Lys-18 to about Asn-20. 


11 OO.dd 


lIUlll dOUUl l^y^-jy/J IXf a.UUUL lyi-JV/O. 


f216.aa 


from about Ser-105 to about Asn-107. 


f328.aa 


from about Asn-105 to about Asp- 107. 


T352.aa 


from about Ser-46 to about Asn-49. 


f867.aa 


from about Thr-3 to about Gly-5, from about Lys-156 to about Ser- 
159. 


fS68.aa 


from about Arg-94 to about Gly-96, from about Pro-257 to about 
Oly-zol, iTom about rro-zyj to about Asp-zv /, irom about Arg- 
340 to about Asp-342. 


to /2.aa 


irom atx)ut oer-iy to about Lys-Z3, irom about inr-i JV to about 
/vsp- m-^j irom aooui oer-zo-^i lo aooui i yr-zoo, irom auoui oci- o 1 1 
to about Ser-313. 


f886.aa 


from about Thr-83 to about Asp-85, from about Asp- 106 to about 
Lys-108, from about Lys-143 to about Gly-147, from about Asp- 
186 to about Asn-191. 


f888.aa 


from about Asn-65 to about Asp-67. 


f893.aa 


from about Asn-203 to about Asn-207, frx>m about Thr-446 to 
about Asn-450. 


f605.aa 


from about Arg-3 1 to about Asp-33. 


f606.aa 


from about Asn-68 to about Gly-71, frx)m about Asn-136 to about 
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Lys-ljy, irom about Asn-223 to about lyr-22o, rrom about ber- 
276 to about Tyr-279, from about Pro-362 to about Asn-365, from 




about Arg-5UJ to about lrp-5U7. 


f679.aa 


from about Lys-154 to about Asp- 156, from about Lys-224 to 
about Arg-226, from about Asn-260 to about Asp-264, from about 
oju-aoj to aoout i-fys-joo, rrom aoout AspOo/ to aooui Criy-^oy, 
from 




about Tyr-441 to about Lys-443, from about Arg-501 to about Tyr- 


fll-12.aa 


from about Pro-91 to about Asn-93, from about Pro-181 to about 
Asp-loo, irom aooui Lys-z^f^ ro aooui oer-Z'fo. 


fll-4.aa 


from about Asn-160 to about Lys-163. 


114-o.aa 


irom about rro-92 to about Oln-yD, rrom about JLys-123 to about 
inr-izj, irom aooui i-»ys-zi3 lo aooui /vsp-ziy. 


fl7-6.aa 


from about Pro-36 to about Glu-38. 


iiy-z.aa 


irom about oer-104 to about aer-lOo, rrom about Oln-z30 to about 
Asn-232. 


•fid A 


irom about vai-/v to about iiir-o2, rrom about rro-iyj to about 
Gly-201. 


iiy-o.aa 


from about Asp-24 to about Lys-30, from about Pro-36 to about 
Glu-38. 


izi-4.aa 


from about Cys-24 to about Asn-26. 




irom aoout oer- / / to aoout i^ys-ou, irom aoout i yr-z to aooui 
Asn-277. 


izo-j.aa 


from about Glu-53 to about Arg-57, from about Gln-82 to about 

Aon fi^ -ff/^m oVtrMif /^1n 1 fr\ oKrxiif Aon 1 

/\sn-o^, irom aooui om-i j / lo aooui /\sn-i jy. 


f31-2.aa 


from about Arg-95 to about Arg-97, from, about Asn-297 to about 

A cn OOO 

/VSn-Z!7!7. 


fA 1 <\ oo 

iD.aa 


rrom aoout rro-ioz to aoout Asp-ion-, rrom aoout i^ys-zzu to aoout 
Asp-222. 


f4-50.aa 


from about Thr- 1 09 to about Asn- 111. 


i42-l.aa 


rrom about Asn-5D to about Asn-37, rrom about Arg-ol to about 
Ser-84, from about Asp-94 to about Asn-97. 


i4!)-z.aa 


rrom about Asn-o3 to about Oly-oo. 


t4/-z.aa 


from about Ser-29 to about Asp-33, from about Asn-94 to about 
Lys-99, from about Pro- 152 to about Lys-157. 




from about Asn-452 to about Gly-454. 


f5-14.aa 


from about Glu-102 to about Asp- 106, from about Thr-272 to about 
Asn-275, from about Glu-313 to about Asn-315, from about Ser- 
J / u to aoout oer-:5 /z. 


f <; 1 oo 


from about Lys-170 to about Gly-173, from about Asn- 194 to 
about Gly-196. 


i3i-z.aa 


rrom about Asj>-3U2 to about Lrys-3u4. 


lU Z 1 •da 


liUIil aDUUl VJIU-JO to auUUl /\an-H'Z, irom dDUUL /\rg-0*f lO aUUUL 

Gly-87. 


f6-27.aa 


from about Asp-67 to about Asn-69, from about Arg-85 to about 
Asn-89, from about Lys-168 to about Gly-171, from about Lys-179 
to about Asn-181, from about Ser-380 to about His-382. 


f6-5.aa 


from about Ser-67 to about Asn-71. 


f7-30.aa 


from about Pro-94 to about Asp-96, from about Lys-144 to about 
Arg-147. 


f76-l.aa 


from about Asn-30 to about Lys-35, from about Lys-1 13 to about 
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Gly-1 16, from about Glu-1 19 to about Lys-121. 


f8-10.aa 


from about Pro-25 to about Lys-32, from about Ser-168 to about 
Thr-172. 


f01a.aa_bb001 


from about Pro- 123 to about Asp-125, from about Ser-179 to about 
Asp- 181, fixjm about Lys-255 to about Gly-259. 


_bb001 1 


from about Ala8 about Arg 17, from about Tyr31 to about Gly40, 
from about Ser65 to about Lys78, from about Val93 to about 
Aspl02, from about Serl20 to about Ilel29, from about Prol56 to 
about Glul70, from about Lysl87 to about Asn 196, from about 
His205 to about Lys214, from about Gly226 to about Glu235, fro 
about Gln253 to about Asn266, from about Glu283 to about 
Glu293, from about Leu31 1 to about De320, from about Arg326 to 
about Gly335, from about Pro340 to about Ala349 


fO2a.aa_bb002 


from about Tyr-169 to about Asn- 171, from about Tyr-242 to about 
Asn-245, from about Lys-264 to about Asp-267. 


_bb9 


from about Met? to about Lysl6, from about Lys47 to about Ser57, 
from about AsnSO to about Ser89, from about Gly 103 to about 
Glul 13, from about Lysl25 to about Prol33, from about Lysl38 
to about Ala 147 


fD3a.aa_bb006 


from about Asp-54 to about Thr-57, from about Lys-201 to about 
His-204. 


_bb014 


from about Pro23 to about Gln31, from about Ser37 to about 
Asp45, from about Leu76 to about Asn84, from about Leu76 to 
about Val84, from about Ser89 to about Asn97, from about SerlOS 
to about Lysl 13, from about Asn 120 to about Met 128, from about 
Asn 159 to about Gly 167, from about Lysl73 to about BallSl 


_bb023 


from about Asp 17 to about Gly 27, from about Arg40 to about 
Asp48, from about Val64 to about Asp72, from about GlulOS to 
about Thrl 13, from about Serl41 to about Gly 150, from about 
Aspl55 to about Ilelo3, irom about Asnl84 to about Lysl98, irom 
about Ile219 to about Pro227, from about Ser230 to about Phe238, 
from about Ser241 to about Asn250, from about Asp270 to about 
Val278, from about Ser285 to about Leu293, from about Glyu307 
to about Ser315, from about Lys327 to about Asn335 


f08a.aa_bb024 


from about Asn-30 to about Asp-33, from about Ser-1 16 to about 
Asn- 1 18, from about Asn- 154 to about Gly- 156. 


fD9a.aa_bb025 


from about Asn-30 to about Ser-35, from about Thr-145 to about 
Asn-148. 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide sequence 
selected from the group consisting of: 

(a) a nucleotide sequence encoding any one of the amino acid sequences of the 
polypeptides shown in Table 1; or 

(b) a nucleotide sequence complementary to any one of the nucleotide sequences in (a). 

(c) a nucleotide sequence at least 95% identical to any one of the nucleotide sequences 
shown in Table 1; or, 

(d) a nucleotide sequence at least 95% identical to a nucleotide sequence complementary to 
any one of the nucleotide sequences shown in Table 1. 

2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide having a nucleotide sequence identical 
to a nucleotide sequence in (a) or (b) of claim 1. 

3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which encodes an 
epitope-bearing portion of a polypeptide in (a) of claim 1. 

4. The isolated nucleic acid molecule of claim 3, wherein said epitope-bearing portion of a 
polypeptide comprises an amino acid sequence listed in Table 4. 

5. A method for making a recombinant vector comprising inserting an isolated nucleic acid 
molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A method of producing a polypeptide comprising: 

(a) growing the host cell of claim 7 such that the protein is expressed by the cell; and 

(b) recovering the expressed polypeptide. 

9. An isolated polypeptide comprising a polypeptide selected from the group 
consisting of: 
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(a) a polypeptide consisting of one of the complete amino acid sequences of Table 1 ; 

(b) a polypeptide consisting of one the complete amino acid sequences of Table 1 except 
the N-teraiinal residue; 

(c) a fragment of the polypeptide of (a) having biological activity; and 

(d) a fragment of the polypeptide of (a) which binds to an antibody specific for the 
polypeptide of (a). 

10. An isolated antibody specific for the polypeptide of claim 9. 

11. A polypeptide produced according to the method of claim 8. 

12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to a 
sequence selected from the group consisting of an amino acid sequence of any one of the 
polypeptides in Table 1. 

13. An isolated polypeptide antigen comprising an amino acid sequence of an B, burgdorferi 
epitope shown in Table 4. 

14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide sequence 
encoding a polypeptide of claim 9. 

15. A hybridoma which produces an antibody of claim 10. 

16. A vaccine, comprising: 

(1) one or more B. burgdorferi polypeptides selected from the group consisting of a 
polypeptide of claim 9; and 

(2) a pharmaceutically acceptable diluent, carrier, or excipient; 

wherein said polypeptide is present, in an amount effective to elicit protective antibodies 
in an animal to a member of the Borrelia genus. 

17. A method of preventing or attenuating an infection caused by a member of the Borrelia genus 
in an animal, comprising administering to said animal a polypeptide of claim 9, wherein said 
polypeptide is administered in an amount effective to prevent or attenuate said infection. 

18. A method of detecting Borrelia nucleic acids in a biological sample comprising: 

(a) contacting the sample with one or more nucleic acids of claim 1, under conditions 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the one or more Borrelia nucleic acid 
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19. A method of detecting Borrelia nucleic acids in a biological sample obtained from an animal, 
comprising: 

(a) amplifying one or more Borrelia nucleic acid sequences in said sample using 
polymerase chain reaction, and 

(b) detecting said amplified Borrelia nucleic acid. 

20. A kit for detecting Borrelia antibodies in a biological sample obtained from an animal, 
comprising 

(a) a polypeptide of claim 9 attached to a solid support; and 

(b) detecting means. 

21. A method of detecting Borrelia antibodies in a biological sample obtained from an animal, 
comprising 

(a) contacting the sample with a polypeptide of claim 9; and 

(b) detecting antibody-antigen complexes. 



